Efficient Exploitation of Factored Domains in Bayesian Reinforcement Learning for POMDPs

None, None; None, None; None, None

Efficient Exploitation of Factored Domains in Bayesian Reinforcement Learning for POMDPs

Conference Paper (2018)

Author(s)

Sammie Katt (Northeastern University)

Frans A. Oliehoek (University of Liverpool, TU Delft - Interactive Intelligence)

Christopher Amato (Northeastern University)

Research Group

Interactive Intelligence

Copyright

Refereed, workshop

To reference this document use:

https://resolver.tudelft.nl/uuid:995e44c1-b9b1-4f15-ae3c-09bcb51207c9

More Info

expand_more

Publication Year

2018

Language

English

Copyright

Research Group

Interactive Intelligence

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

While the POMDP has proven to be a powerful framework to model and solve partially observable stochastic problems, it assumes ac- curate and complete knowledge of the environment. When such information is not available, as is the case in many real world appli- cations, one must learn such a model. The BA-POMDP considers the model as part of the hidden state and explicitly considers the uncertainty over it, and as a result transforms the learning problem into a planning problem. This model, however, grows exponentially with the underlying POMDP size, and becomes intractable for non- trivial problems. In this article we propose a factored framework, the FBA-POMDP that represents the model as a Bayes-Net, dras- tically decreasing the number of parameters required to describe the dynamics of the environment. We demonstrate that the our ap- proach allows solvers to tackle problems much larger than possible in the BA-POMDP.

Files

ALA_2018_paper_49_1.pdf

(pdf | 1 Mb)

License info not available