Safe and Adaptive 3-D Locomotion via Constrained Task-Space Imitation Learning
J. Ding (TU Delft - Learning & Autonomous Control)
Tin Lun Lam (Chinese University of Hong Kong, Shenzhen Institute of Artificial Intelligence and Robotics for Society)
Ligang Ge (Ubtech Robotics Corporation)
Jianxin Pang (Ubtech Robotics Corporation)
Yanlong Huang (University of Leeds)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Bipedal locomotion has been widely studied in recent years, where passive safety (i.e., a biped rapidly brakes without falling) is deemed to be a pivotal problem. To realize safe 3-D walking, existing works resort to nonlinear optimization techniques based on simplified dynamics models, requiring hand-tuned reference trajectories. In this article, we propose to integrate safety constraints into constrained task-space imitation learning, endowing a humanoid robot with adaptive walking capability. Specifically, unlike previous work using nonlinear and coupled capturability dynamics, we first linearize the 3-D capture conditions using appropriate extreme values and then seamlessly incorporate them into constrained imitation learning. Furthermore, we propose novel heuristic rules to define control points, enabling adaptive locomotion learning. The resulting framework allows robots to learn locomotion skills from a few demonstrations efficiently and apply the learned skills to unseen 3-D scenarios while satisfying the constraints for passive safety. Unlike deep enforcement learning, our framework avoids the need of a large number of iterations or sim-to-real transfer. By virtue of the task-space adaptability, the proposed imitation learning framework can reuse collected demonstrations in a new robot platform. We validate our method by hardware experiments on Walker2 robot and simulations on COMAN robot.