Safe and Adaptive 3-D Locomotion via Constrained Task-Space Imitation Learning

Journal article (2023)

Authors

J. Ding Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Tin Lun Lam Chinese University of Hong Kong, Shenzhen Institute of Artificial Intelligence and Robotics for Society

Ligang Ge Ubtech Robotics Corporation

Jianxin Pang Ubtech Robotics Corporation

Yanlong Huang University of Leeds

Research Group

Learning & Autonomous Control (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI

https://doi.org/10.1109/TMECH.2023.3239099

Safety Bipedal locomotion Robots Lips Passive safety Humanoid robot Trajectory Legged locomotion Task analysis Solid modeling 3-D walking Constrained imitation learning

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:528e1625-efd1-4d40-a802-4a9c704d1880

Published Date

2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Learning & Autonomous Control

Abstract

Bipedal locomotion has been widely studied in recent years, where passive safety (i.e., a biped rapidly brakes without falling) is deemed to be a pivotal problem. To realize safe 3-D walking, existing works resort to nonlinear optimization techniques based on simplified dynamics models, requiring hand-tuned reference trajectories. In this article, we propose to integrate safety constraints into constrained task-space imitation learning, endowing a humanoid robot with adaptive walking capability. Specifically, unlike previous work using nonlinear and coupled capturability dynamics, we first linearize the 3-D capture conditions using appropriate extreme values and then seamlessly incorporate them into constrained imitation learning. Furthermore, we propose novel heuristic rules to define control points, enabling adaptive locomotion learning. The resulting framework allows robots to learn locomotion skills from a few demonstrations efficiently and apply the learned skills to unseen 3-D scenarios while satisfying the constraints for passive safety. Unlike deep enforcement learning, our framework avoids the need of a large number of iterations or sim-to-real transfer. By virtue of the task-space adaptability, the proposed imitation learning framework can reuse collected demonstrations in a new robot platform. We validate our method by hardware experiments on Walker2 robot and simulations on COMAN robot.

Files

Safe_and_Adaptive_3_D_Locomoti... (.pdf)

(.pdf | 3.16 Mb)