Learning from Few Demonstrations with Frame-Weighted Motion Generation

Master Thesis (2022)
Author(s)

J. Sun (TU Delft - Mechanical Engineering)

Contributor(s)

J. Kober – Mentor (TU Delft - Learning & Autonomous Control)

J. Zhu – Mentor (TU Delft - Learning & Autonomous Control)

Michael Gienger – Mentor (Honda Research Institute Europe GmbH)

L. Peternel – Graduation committee member (TU Delft - Human-Robot Interaction)

Arno H.A. Stienen – Graduation committee member (TU Delft - Biomechatronics & Human-Machine Control)

Faculty
Mechanical Engineering
Copyright
© 2022 Jianyong Sun
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Jianyong Sun
Graduation Date
22-12-2022
Awarding Institution
Delft University of Technology
Programme
['Mechanical Engineering | Vehicle Engineering | Cognitive Robotics']
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Learning from Demonstration (LfD) aims to learn versatile skills from human demonstrations. The field has been gaining popularity since it facilitates transferring knowledge to robots without requiring much expert knowledge. During task executions, the robot motion is usually influenced by constraints imposed by environments. In light of this, task-parameterized (TP) learning encodes relevant contextual information in reference frames, enabling better skill generalization to new situations. However, most TP learning algorithms require multiple demonstrations in various environment conditions to ensure sufficient statistics for a meaningful model. It is not a trivial task for robot users to create different situations and perform demonstrations under all of them. Therefore, this paper presents a novel concept to learn motion policy from few demonstrations through explicitly solving reference frame weights along the task trajectory. Experimental results in both simulation and real robotic environments validate our approach.

Files

Thesis_Jianyong.pdf
(pdf | 10.7 Mb)
License info not available