Efficient Exploitation of Factored Domains in Bayesian Reinforcement Learning for POMDPs

Conference Paper (2018)
Author(s)

Sammie Katt (Northeastern University)

FA Oliehoek (University of Liverpool, TU Delft - Interactive Intelligence)

Christopher Amato (Northeastern University)

Research Group
Interactive Intelligence
Copyright
© 2018 Sammie Katt, F.A. Oliehoek, Christopher Amato
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 Sammie Katt, F.A. Oliehoek, Christopher Amato
Research Group
Interactive Intelligence
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

While the POMDP has proven to be a powerful framework to model and solve partially observable stochastic problems, it assumes ac- curate and complete knowledge of the environment. When such information is not available, as is the case in many real world appli- cations, one must learn such a model. The BA-POMDP considers the model as part of the hidden state and explicitly considers the uncertainty over it, and as a result transforms the learning problem into a planning problem. This model, however, grows exponentially with the underlying POMDP size, and becomes intractable for non- trivial problems. In this article we propose a factored framework, the FBA-POMDP that represents the model as a Bayes-Net, dras- tically decreasing the number of parameters required to describe the dynamics of the environment. We demonstrate that the our ap- proach allows solvers to tackle problems much larger than possible in the BA-POMDP.

Files

License info not available