Interactive Learning in State-space

Enabling robots to learn from non-expert humans

Master thesis (2020)

Authors

S. Jauhri Electrical Engineering, Mathematics and Computer Science

Contributors

J. Kober Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering (mentor)

Carlos Celemin Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering (mentor)

A.J. van Genderen Computer Engineering - (graduation committee member)

L. Peternel Human-Robot Interaction - Mechanical, Maritime and Materials Engineering (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Imitation Learning Robot Control Learning from Demonstrations

To reference this document use:

http://resolver.tudelft.nl/uuid:be1a04dc-1780-4683-9a7c-77434cd77fa7

More Info

expand_more

Published Date

14-05-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Imitation Learning is a technique that enables programming the behavior of agents through demonstration, as opposed to manually engineering behavior. However, Imitation Learning methods require demonstration data (in the form of state-action labels) and in many scenarios, the demonstrations can be expensive to obtain or too complex for a demonstrator to execute. This lack or sub-optimality of demonstrations limits the applicability and performance of many Imitation Learning methods.

Advancements in Interactive Imitation Learning techniques however, have made it easier for demonstrators to train agents and improve their performance. These techniques involve demonstrators interacting with and guiding the agent as it performs the requisite task. This guidance is typically in the form of corrections or feedback on the current actions being executed by the agent.

In this thesis, a novel Interactive Learning technique is proposed that uses human corrective feedback in state-space to train and improve agent behavior. This technique is beneficial since providing guidance to the agent in terms of `changing its state' is often easier or more intuitive for the human demonstrator (as opposed to changing the actions being executed). For instance, in manipulation tasks using a robotic arm, it is easier for the demonstrator to provide state information such as the Cartesian position of the end-effector rather than low-level action information such as joint angles. Keeping such scenarios in mind, we propose our method titled: Teaching Imitative Policies in State-space (TIPS).

We evaluate the performance of TIPS for various control tasks as part of the OpenAI Gym toolkit as well as for a manipulation task using a KUKA LBR iiwa robotic arm. We show that through continuous improvement via feedback, agents trained using TIPS outperform the demonstrator and in-turn outperform conventional Imitation Learning agents.

Files

MscThesis_SnehalJauhri.pdf

(pdf | 12.3 Mb)