Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots

None, None; None, None; None, None; None, None; None, None

Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots

Conference Paper (2022)

Author(s)

Xinglong Zhang (National University of Defense Technology)

Yaoqian Peng (National University of Defense Technology)

Wei Pan (TU Delft - Robot Dynamics)

Xin Xu (National University of Defense Technology)

Haibin Xie (National University of Defense Technology)

Research Group

Robot Dynamics

Copyright

DOI related publication

https://doi.org/10.1109/ICRA46639.2022.9811604

Safety Reinforcement learning Heuristic algorithms Formation control Mobile robots Prediction algorithms Regulators

To reference this document use:

https://resolver.tudelft.nl/uuid:f4e1f94d-0ec8-4f4a-82be-07b1cd2385e2

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Research Group

Robot Dynamics

Pages (from-to)

5532-5538

ISBN (print)

978-1-7281-9680-0

ISBN (electronic)

978-1-7281-9681-7

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Distributed model predictive control (DMPC) concerns how to online control multiple robotic systems with constraints effectively. However, the nonlinearity, nonconvexity, and strong interconnections of dynamic system models and constraints can make the real-time and real-world DMPC implementations nontrivial. Reinforcement learning (RL) algorithms are promising for control policy design. However, how to ensure safety in terms of state constraints in RL remains a significant issue. This paper proposes a barrier function-based safe reinforcement learning algorithm for DMPC of nonlinear multi-robot systems under state constraints. The proposed approach is composed of several local learning-based MPC regulators. Each regulator, associated with a local system, learns and deploys the local control policy using a safe reinforcement learning algorithm in a distributed manner, i.e., with state information only among the neighbor agents. As a prominent feature of the proposed algorithm, we present a novel barrier-based policy structure to ensure safety, which has a clear mechanistic interpretation. Both simulated and real-world experiments on the formation control of mobile robots with collision avoidance show the effectiveness of the proposed safe reinforcement learning algorithm for DMPC.

Files

Barrier_Function_based_Safe_Re... (pdf)

(pdf | 0.989 Mb)

- Embargo expired in 01-07-2023

License info not available