Learning from Few Demonstrations with Frame-Weighted Motion Generation

None, None

Learning from Few Demonstrations with Frame-Weighted Motion Generation

Master Thesis (2022)

Author(s)

J. Sun (TU Delft - Mechanical Engineering)

Contributor(s)

J. Kober – Mentor (TU Delft - Learning & Autonomous Control)

J. Zhu – Mentor (TU Delft - Learning & Autonomous Control)

Michael Gienger – Mentor (Honda Research Institute Europe GmbH)

L. Peternel – Graduation committee member (TU Delft - Human-Robot Interaction)

A.H.A. Stienen – Graduation committee member (TU Delft - Biomechatronics & Human-Machine Control)

Faculty

Mechanical Engineering

Copyright

Data Augmentation Learning from Demonstration Few Demonstrations Frame Weights

To reference this document use:

https://resolver.tudelft.nl/uuid:88639dbd-c0c3-44a9-a4fd-f92548e79bd0

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Graduation Date

22-12-2022

Awarding Institution

Delft University of Technology

Programme

['Mechanical Engineering | Vehicle Engineering | Cognitive Robotics']

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Learning from Demonstration (LfD) aims to learn versatile skills from human demonstrations. The field has been gaining popularity since it facilitates transferring knowledge to robots without requiring much expert knowledge. During task executions, the robot motion is usually influenced by constraints imposed by environments. In light of this, task-parameterized (TP) learning encodes relevant contextual information in reference frames, enabling better skill generalization to new situations. However, most TP learning algorithms require multiple demonstrations in various environment conditions to ensure sufficient statistics for a meaningful model. It is not a trivial task for robot users to create different situations and perform demonstrations under all of them. Therefore, this paper presents a novel concept to learn motion policy from few demonstrations through explicitly solving reference frame weights along the task trajectory. Experimental results in both simulation and real robotic environments validate our approach.

Files

Thesis_Jianyong.pdf

(pdf | 10.7 Mb)

License info not available