TamedPUMA

safe and stable imitation learning with geometric fabrics

Journal Article (2025)
Author(s)

S. Bakker (TU Delft - Learning & Autonomous Control)

Rodrigo Pérez-Dattari (TU Delft - Learning & Autonomous Control)

C. Della Santina (TU Delft - Learning & Autonomous Control)

J.W. Böhmer (TU Delft - Sequential Decision Making)

J. Alonso-Mora (TU Delft - Learning & Autonomous Control)

Research Group
Learning & Autonomous Control
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Learning & Autonomous Control
Volume number
283
Pages (from-to)
405-418
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Using the language of dynamical systems, Imitation learning (IL) provides an intuitive and effective way of teaching stable task-space motions to robots with goal convergence. Yet, IL techniques are affected by serious limitations when it comes to ensuring safety and fulfillment of physical constraints. With this work, we solve this challenge via TamedPUMA, an IL algorithm augmented with a recent development in motion generation called geometric fabrics. As both the IL policy and geometric fabrics describe motions as artificial second-order dynamical systems, we propose two variations where IL provides a navigation policy for geometric fabrics. The result is a stable imitation learning strategy within which we can seamlessly blend geometrical constraints like collision avoidance and joint limits. Beyond providing a theoretical analysis, we demonstrate TamedPUMA with simulated and real-world tasks, including a 7-DoF manipulator.

Files

Bakker25a.pdf
(pdf | 9.53 Mb)
License info not available