Safety Guarantees in Multi-agent Learning via Trapping Regions

None, None; None, None

Safety Guarantees in Multi-agent Learning via Trapping Regions

Journal Article (2023)

Author(s)

A.T. Czechowski (TU Delft - Interactive Intelligence)

F.A. Oliehoek (TU Delft - Interactive Intelligence)

Research Group

Interactive Intelligence

Learning dynamics Multi-agent learning Safety sets

To reference this document use:

https://resolver.tudelft.nl/uuid:9870f34c-455a-4481-be42-7bf0de64aed4

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Interactive Intelligence

Volume number

2023-May

Pages (from-to)

2403-2405

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to most single-agent environments, and sets a prohibitive barrier for deployment in practical applications, as it induces uncertainty in long term behavior of the system. In this work, we propose to apply the concept of trapping regions, known from qualitative theory of dynamical systems, to create safety sets in the joint strategy space for decentralized learning. Upon verification of the direction of learning dynamics, the resulting trajectories are guaranteed not to escape such sets, during the learning process. As a result, it is ensured, that despite the uncertainty over convergence of the applied algorithms, learning will never form hazardous joint strategy combinations.

Files

3545946.3598948.pdf

(pdf | 1.18 Mb)

- Embargo expired in 27-11-2023

License info not available