Safety Guarantees in Multi-agent Learning via Trapping Regions

Journal Article (2023)
Author(s)

Aleksander Czechowski (TU Delft - Interactive Intelligence)

F.A. Oliehoek (TU Delft - Interactive Intelligence)

Research Group
Interactive Intelligence
Copyright
© 2023 A.T. Czechowski, F.A. Oliehoek
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 A.T. Czechowski, F.A. Oliehoek
Research Group
Interactive Intelligence
Volume number
2023-May
Pages (from-to)
2403-2405
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to most single-agent environments, and sets a prohibitive barrier for deployment in practical applications, as it induces uncertainty in long term behavior of the system. In this work, we propose to apply the concept of trapping regions, known from qualitative theory of dynamical systems, to create safety sets in the joint strategy space for decentralized learning. Upon verification of the direction of learning dynamics, the resulting trajectories are guaranteed not to escape such sets, during the learning process. As a result, it is ensured, that despite the uncertainty over convergence of the applied algorithms, learning will never form hazardous joint strategy combinations.

Files

3545946.3598948.pdf
(pdf | 1.18 Mb)
- Embargo expired in 27-11-2023
License info not available