Learning Multi-Reference Frame Skills from Demonstration with Task-Parameterized Gaussian Processes

None, None; None, None; None, None; None, None

Learning Multi-Reference Frame Skills from Demonstration with Task-Parameterized Gaussian Processes

Conference Paper (2024)

Author(s)

M. Ramirez Montero (TU Delft - Learning & Autonomous Control)

Giovanni Franzese (TU Delft - Learning & Autonomous Control)

Jens Kober (TU Delft - Learning & Autonomous Control)

Jenny Lieu (TU Delft - Learning & Autonomous Control)

Research Group

Learning & Autonomous Control

DOI related publication

https://doi.org/10.1109/IROS58592.2024.10803060

To reference this document use:

https://resolver.tudelft.nl/uuid:a9dfc392-6c67-4624-9abc-440f8a46b8a7

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Learning & Autonomous Control

Pages (from-to)

2832-2839

ISBN (electronic)

979-8-3503-7770-5

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

A central challenge in Learning from Demonstration is to generate representations that are adaptable and can generalize to unseen situations. This work proposes to learn such a representation without using task-specific heuristics within the context of multi-reference frame skill learning by superimposing local skills in the global frame. Local policies are first learned by fitting the relative skills with respect to each frame using Gaussian Processes (GPs). Then, another GP, which determines the relevance of each frame for every time step, is trained in a self-supervised manner from a different batch of demonstrations. The uncertainty quantification capability of GPs is exploited to stabilize the local policies and to train the frame relevance in a fully Bayesian way. We validate the method through a dataset of multi-frame tasks generated in simulation and on real-world experiments with a robotic manipulation pick-and-place re-shelving task.We evaluate the performance of our method with two metrics: how close the generated trajectories get to each of the task goals and the deviation between these trajectories and test expert trajectories. According to both of these metrics, the proposed method consistently outperforms the state-of-the-art baseline, Task-Parameterised Gaussian Mixture Model (TPGMM).

Files

Learning_Multi-Reference_Frame... (pdf)

(pdf | 8.91 Mb)

- Embargo expired in 25-06-2025

License info not available