Curriculum-Based Reinforcement Learning for Quadrupedal Jumping

None, None; None, None; None, None; None, None; None, None

Curriculum-Based Reinforcement Learning for Quadrupedal Jumping

A Reference-Free Design

Journal Article (2025)

Author(s)

Vassil Atanassov (University of Oxford)

J. Ding (TU Delft - Learning & Autonomous Control)

J. Kober (TU Delft - Learning & Autonomous Control)

Ioannis Havoutis (University of Oxford)

C. Della Santina (TU Delft - Learning & Autonomous Control, Deutsches Zentrum für Luft- und Raumfahrt (DLR))

Research Group

Learning & Autonomous Control

DOI related publication

https://doi.org/10.1109/MRA.2024.3487325

To reference this document use:

https://resolver.tudelft.nl/uuid:6492dd6a-6d68-4b27-aaa4-b524d8e40d73

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Learning & Autonomous Control

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository as part of the Taverne amendment. More information about this copyright law amendment can be found at https://www.openaccess.nl. Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en

Issue number

2

Volume number

32

Pages (from-to)

35-48

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep reinforcement learning (DRL) has emerged as a promising solution to mastering explosive and versatile quadrupedal jumping skills. However, current DRL-based frameworks usually rely on pre-existing reference trajectories obtained by capturing animal motions or transferring experience from existing controllers. This work aims to prove that learning dynamic jumping is possible without relying on imitating a reference trajectory by leveraging a curriculum design. Starting from a vertical in-place jump, we generalize the learned policy to forward and diagonal jumps and, finally, we learn to jump across obstacles. Conditioned on the desired landing location, orientation, and obstacle dimensions, the proposed approach yields a wide range of omnidirectional jumping motions in real-world experiments. In particular, we achieve a 90 cm forward jump, exceeding all previous records for similar robots. Additionally, the robot can reliably execute continuous jumping on soft grassy grounds, which is especially remarkable as such conditions were not included in the training stage.

Files

Curriculum-Based_Reinforcement... (pdf)

(pdf | 5.2 Mb)

- Embargo expired in 20-05-2025

License info not available