Predictability Awareness For Efficient and Robust Multi-Agent Coordination

None, None

Predictability Awareness For Efficient and Robust Multi-Agent Coordination

Master Thesis (2025)

Author(s)

R. Chiva Gil (TU Delft - Aerospace Engineering)

Contributor(s)

J. Alonso-Mora – Mentor (TU Delft - Learning & Autonomous Control)

Guido C.H.E.de de Croon – Mentor (TU Delft - Control & Simulation)

Daniel Jarne Ornia – Mentor (TU Delft - Learning & Autonomous Control)

K.A. Mustafa – Mentor (TU Delft - Learning & Autonomous Control)

C de Wagter – Graduation committee member (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Robotics Coordination Decentralized Control Multi-agent systems Model Predictive Control (MPC) Autonomous Driving Systems

To reference this document use:

https://resolver.tudelft.nl/uuid:c70e148b-a3ac-424d-a633-3d0310538064

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

14-01-2025

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering | Control & Operations']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

To safely and efficiently solve motion planning problems in multi-agent settings, most approaches attempt to solve a joint optimization that explicitly accounts for the responses triggered in other
agents. This often results in solutions with an exponential computational complexity, making these methods intractable for complex scenarios with many agents. While sequential predict-and-plan approaches are more scalable, they tend to perform poorly in highly interactive environments. This paper proposes a method to improve the interactive capabilities of sequential predict-and-plan methods in multi-agent navigation problems by introducing predictability as an optimization objective. We interpret predictability through the use of general prediction models, by allowing agents to predict themselves and estimate how they align with these external predictions. We formally introduce this behavior through the free-energy of the system, which reduces (under appropriate bounds) to the Kullback-Leibler divergence between plan and prediction, and use this as a penalty for unpredictable trajectories. The proposed interpretation of predictability allows agents to more robustly leverage prediction models, and fosters a ‘soft social convention' that accelerates agreement on coordination strategies without the need of explicit high level control or communication. We show how this predictability-aware planning leads to lower-cost trajectories and reduces planning effort in a set of multi-robot problems, including autonomous driving experiments with human driver data, where we show that the benefits of considering predictability apply even when only the ego-agent uses this strategy.

Files

MScThesisRomanChiva.pdf

(pdf | 15.7 Mb)

License info not available