Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots

Conference Paper (2022)
Author(s)

Xinglong Zhang (National University of Defense Technology)

Yaoqian Peng (National University of Defense Technology)

Wei Pan (TU Delft - Robot Dynamics)

Xin Xu (National University of Defense Technology)

Haibin Xie (National University of Defense Technology)

Research Group
Robot Dynamics
Copyright
© 2022 Xinglong Zhang, Yaoqian Peng, W. Pan, Xin Xu, Haibin Xie
DOI related publication
https://doi.org/10.1109/ICRA46639.2022.9811604
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Xinglong Zhang, Yaoqian Peng, W. Pan, Xin Xu, Haibin Xie
Research Group
Robot Dynamics
Pages (from-to)
5532-5538
ISBN (print)
978-1-7281-9680-0
ISBN (electronic)
978-1-7281-9681-7
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Distributed model predictive control (DMPC) concerns how to online control multiple robotic systems with constraints effectively. However, the nonlinearity, nonconvexity, and strong interconnections of dynamic system models and constraints can make the real-time and real-world DMPC implementations nontrivial. Reinforcement learning (RL) algorithms are promising for control policy design. However, how to ensure safety in terms of state constraints in RL remains a significant issue. This paper proposes a barrier function-based safe reinforcement learning algorithm for DMPC of nonlinear multi-robot systems under state constraints. The proposed approach is composed of several local learning-based MPC regulators. Each regulator, associated with a local system, learns and deploys the local control policy using a safe reinforcement learning algorithm in a distributed manner, i.e., with state information only among the neighbor agents. As a prominent feature of the proposed algorithm, we present a novel barrier-based policy structure to ensure safety, which has a clear mechanistic interpretation. Both simulated and real-world experiments on the formation control of mobile robots with collision avoidance show the effectiveness of the proposed safe reinforcement learning algorithm for DMPC.

Files

Barrier_Function_based_Safe_Re... (pdf)
(pdf | 0.989 Mb)
- Embargo expired in 01-07-2023
License info not available