Two-Stage Learning of Highly Dynamic Motions with Rigid and Articulated Soft Quadrupeds

Conference Paper (2024)
Author(s)

F. Vezzi (TU Delft - Learning & Autonomous Control)

Jiatao Ding (TU Delft - Learning & Autonomous Control)

Antonin Raffin (Deutsches Zentrum für Luft- und Raumfahrt (DLR))

Jens Kober (TU Delft - Learning & Autonomous Control)

Cosimo Lieu (TU Delft - Learning & Autonomous Control, Deutsches Zentrum für Luft- und Raumfahrt (DLR))

Research Group
Learning & Autonomous Control
DOI related publication
https://doi.org/10.1109/ICRA57147.2024.10610561
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Learning & Autonomous Control
Pages (from-to)
9720-9726
ISBN (electronic)
979-8-3503-8457-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Controlled execution of dynamic motions in quadrupedal robots, especially those with articulated soft bodies, presents a unique set of challenges that traditional methods struggle to address efficiently. In this study, we tackle these issues by relying on a simple yet effective two-stage learning framework to generate dynamic motions for quadrupedal robots. First, a gradient-free evolution strategy is employed to discover simply represented control policies, eliminating the need for a predefined reference motion. Then, we refine these policies using deep reinforcement learning. Our approach enables the acquisition of complex motions like pronking and back-flipping, effectively from scratch. Additionally, our method simplifies the traditionally labour-intensive task of reward shaping, boosting the efficiency of the learning process. Importantly, our framework proves particularly effective for articulated soft quadrupeds, whose inherent compliance and adaptability make them ideal for dynamic tasks but also introduce unique control challenges.

Files

Two-Stage_Learning_of_Highly_D... (pdf)
(pdf | 1.76 Mb)
- Embargo expired in 08-02-2025
License info not available