Learning to Play Trajectory Games Against Opponents with Unknown Objectives

Journal Article (2023)
Author(s)

Xinjie Liu (Student TU Delft)

L. Peters (TU Delft - Learning & Autonomous Control)

J. Alonso-Mora (TU Delft - Learning & Autonomous Control)

Research Group
Learning & Autonomous Control
Copyright
© 2023 Xinjie Liu, L. Peters, J. Alonso-Mora
DOI related publication
https://doi.org/10.1109/LRA.2023.3280809
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Xinjie Liu, L. Peters, J. Alonso-Mora
Research Group
Learning & Autonomous Control
Issue number
7
Volume number
8
Pages (from-to)
4139-4146
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.

Files

Learning_to_Play_Trajectory_Ga... (pdf)
(pdf | 2.9 Mb)
- Embargo expired in 29-11-2023
License info not available