Learning What to Attend to: Using Bisimulation Metrics to Explore and Improve Upon What a Deep Reinforcement Learning Agent Learns

None, None; None, None; None, None

Learning What to Attend to: Using Bisimulation Metrics to Explore and Improve Upon What a Deep Reinforcement Learning Agent Learns

Abstract (2020)

Author(s)

N. Albers (TU Delft - Interactive Intelligence)

M. Suau de Castro (TU Delft - Interactive Intelligence)

Frans A Oliehoek (TU Delft - Interactive Intelligence)

Research Group

Interactive Intelligence

Copyright

Deep Reinforcement Learning Markovianity Representation Learning Bisimulation Metrics

To reference this document use:

https://resolver.tudelft.nl/uuid:2ac1a5d8-165d-4faa-9281-08ab016de001

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Research Group

Interactive Intelligence

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Recent years have seen a surge of algorithms and architectures for deep Re-
inforcement Learning (RL), many of which have shown remarkable success for
various problems. Yet, little work has attempted to relate the performance of
these algorithms and architectures to what the resulting deep RL agents actu-
ally learn, and whether this corresponds to what they should ideally learn. Such
a comparison may allow for both an improved understanding of why certain
algorithms or network architectures perform better than others and the devel-
opment of methods that specically address discrepancies between what is and
what should be learned.

Files

Albers20BNAICBenelearn.pdf

(pdf | 0.323 Mb)

License info not available