Searched for: subject%3A%22inverse%255C%252Breinforcement%255C%252Blearning%22
(1 - 1 of 1)
document
Muench, C. (author), Oliehoek, F.A. (author), Gavrila, D. (author)
Modeling possible future outcomes of robot-human interactions is of importance in the intelligent vehicle and mobile robotics domains. Knowing the reward function that explains the observed behavior of a human agent is advantageous for modeling the behavior with Markov Decision Processes (MDPs). However, learning the rewards that determine...
journal article 2021