Learning to Play Trajectory Games Against Opponents with Unknown Objectives

None, None; None, None; None, None

Learning to Play Trajectory Games Against Opponents with Unknown Objectives

Journal Article (2023)

Author(s)

Xinjie Liu (Student TU Delft)

Lasse Peters (TU Delft - Mechanical Engineering)

Javier Alonso-Mora (TU Delft - Mechanical Engineering)

Research Group

Learning & Autonomous Control

Optimization Robots Games Planning Collision avoidance Multi-robot systems Maximum likelihood estimation Trajectory games Trajectory Human-aware motion planning Integrated planning and learning

DOI related publication

https://doi.org/10.1109/LRA.2023.3280809 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:fafde1ec-a419-4ad7-817e-d8a4e384ca6c

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Learning & Autonomous Control

Issue number

7

Volume number

8

Pages (from-to)

4139-4146

Downloads counter

221

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.

Files

Learning_to_Play_Trajectory_Ga... (pdf)

(pdf | 2.9 Mb)

- Embargo expired in 29-11-2023

License info not available