Where to go next

Learning a Subgoal Recommendation Policy for Navigation in Dynamic Environments

Journal article (2021)

Authors

B.F. Ferreira de Brito Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Michael Everett Massachusetts Institute of Technology

Jonathan Patrick How Massachusetts Institute of Technology

J. Alonso-Mora Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Research Group

Learning & Autonomous Control (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI

https://doi.org/10.1109/LRA.2021.3068662

Navigation Deep Reinforcement Learning Robots Planning Collision avoidance Vehicle dynamics Training Robot kinematics Motion and Path Planning in Dynamic Environments or for Multi-robot Systems

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:9d6316ec-035d-44bc-8d76-85c8a6c74846

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Learning & Autonomous Control

Abstract

Robotic navigation in environments shared with other robots or humans remains challenging because the intentions of the surrounding agents are not directly observable and the environment conditions are continuously changing. Local trajectory optimization methods, such as model predictive control (MPC), can deal with those changes but require global guidance, which is not trivial to obtain in crowded scenarios. This paper proposes to learn, via deep Reinforcement Learning (RL), an interaction-aware policy that provides long-term guidance to the local planner. In particular, in simulations with cooperative and non-cooperative agents, we train a deep network to recommend a subgoal for the MPC planner. The recommended subgoal is expected to help the robot in making progress towards its goal and accounts for the expected interaction with other agents. Based on the recommended subgoal, the MPC planner then optimizes the inputs for the robot satisfying its kinodynamic and collision avoidance constraints. Our approach is shown to substantially improve the navigation performance in terms of number of collisions as compared to prior MPC frameworks, and in terms of both travel time and number of collisions compared to deep RL methods in cooperative, competitive and mixed multiagent scenarios.

Files

09385847.pdf

(.pdf | 1.62 Mb)

Where_to_go_Next_Learning_a_Su... (.pdf)

(.pdf | 1.41 Mb)