Accelerated Vector Pruning for Optimal POMDP Solvers

Conference Paper (2017)
Author(s)

Erwin Walraven (TU Delft - Algorithmics)

Matthijs Spaan (TU Delft - Algorithmics)

Research Group
Algorithmics
Copyright
© 2017 E.M.P. Walraven, M.T.J. Spaan
More Info
expand_more
Publication Year
2017
Language
English
Copyright
© 2017 E.M.P. Walraven, M.T.J. Spaan
Research Group
Algorithmics
Pages (from-to)
3672-3678
ISBN (electronic)
978-1577357803
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Partially Observable Markov Decision Processes (POMDPs) are powerful models for planning under uncertainty in partially observable domains. However, computing optimal solutions for POMDPs is challenging because of the high computational requirements of POMDP solution algorithms. Several algorithms use a subroutine to prune dominated vectors in value functions, which requires a large number of linear programs (LPs) to be solved and it represents a large part of the total running time. In this paper we show how the LPs in POMDP pruning subroutines can be decomposed using a Benders decomposition. The resulting algorithm incrementally adds LP constraints and uses only a small fraction of the constraints. Our algorithm significantly improves the performance of existing pruning methods and the commonly used incremental pruning algorithm. Our new variant of incremental pruning is the fastest optimal pruning-based POMDP algorithm.

Files

Walraven17aaai.pdf
(pdf | 0.53 Mb)
License info not available