A.T. Czechowski | TU Delft Repository

Non-chaotic limit sets in multi-agent learning

Journal article (2023) - A.T. Czechowski (author) , Georgios Piliouras (author)

Non-convergence is an inherent aspect of adaptive multi-agent systems, and even basic learning models, such as the replicator dynamics, are not guaranteed to equilibriate. Limit cycles, and even more complicated chaotic sets are in fact possible even in rather simple games, inclu ...

Safe Multi-agent Learning via Trapping Regions

Conference paper (2023) - A.T. Czechowski (author) , Frans A Oliehoek (author)

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to m ...

Safety Guarantees in Multi-agent Learning via Trapping Regions

Journal article (2023) - A.T. Czechowski (author) , Frans A Oliehoek (author)

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to m ...

Poincaré-Bendixson Limit Sets in Multi-Agent Learning

Conference paper (2022) - A.T. Czechowski (author) , Georgios Piliouras (author)

A key challenge of evolutionary game theory and multi-agent learning is to characterize the limit behavior of game dynamics. Whereas convergence is often a property of learning algorithms in games satisfying a particular reward structure (e.g., zero-sum games), even basic learnin ...

Multi Robot Surveillance and Planning in Limited Communication Environments

Conference paper (2022) - V. Inna Kedege (author) , Aleksander Czechowski (author) , Ludo Stellingwerff (author) , Frans A Oliehoek (author)

Distributed robots that survey and assist with search & rescue operations usually deal with unknown environments with limited communication. This paper focuses on distributed & cooperative multi-robot area coverage strategies of unknown environments, having constrained co ...

Influence-aware memory architectures for deep reinforcement learning in POMDPs

Journal article (2022) - Miguel Suau de Castro (author) , J. He (author) , E. Congeduti (author) , R.A.N. Starre (author) , Aleksander Czechowski (author) , FA Oliehoek (author)

Due to its perceptual limitations, an agent may have too little information about the environment to act optimally. In such cases, it is important to keep track of the action-observation history to uncover hidden state information. Recent deep reinforcement learning methods use r ...

Constraint Propagation and Reverse Multi-Agent Learning

Conference paper (2021) - A.T. Czechowski (author)

The development of multi-agent reinforcement learning has been largely driven by the question of how to design learning algorithms to reach some particular notion of optimality of strategies, e.g. Nash equilibria. The set of optimal strategies is not known before the execution of ...

Exploring the Effects of Conditioning Independent Q-Learners on the Sufficient Statistic for Dec-POMDPs

Conference paper (2020) - A.V. Mandersloot (author) , Frans Oliehoek (author) , Aleksander Czechowski (author)

In this study, we investigate the effects of conditioning Independent Q-Learners (IQL) not solely on the individual action-observation history, but additionally on the sufficient plan-time statistic for Decentralized Partially Observable Markov Decision Processes. In doing so, we ...

Alternating Maximization with Behavioral Cloning

Conference paper (2020) - A.T. Czechowski (author) , Frans A Oliehoek (author)

The key difficulty of cooperative, decentralized planning lies in making accurate predictions about the behavior of one’s teammates. In this paper we introduce a planning method of Alternating maximization with Behavioural Cloning (ABC) – a trainable online decentralized planning ...

Decentralized MCTS via Learned Teammate Models

Conference paper (2020) - A.T. Czechowski (author) , Frans A Oliehoek (author)

Decentralized online planning can be an attractive paradigm for cooperative multi-agent systems, due to improved scalability and robustness. A key difficulty of such approach lies in making accurate predictions about the decisions of other agents. In this paper, we present a trai ...

Influence-Based Abstraction in Deep Reinforcement Learning

Conference paper (2019) - M. Suau de Castro (author) , Elena Congeduti (author) , Rolf Starre (author) , Aleksander Czechowski (author) , F.A. Oliehoek (author)

thousands, or even millions of state variables. Unfortunately, applying reinforcement learning algorithms to handle complex tasks becomes more and more challenging as the number of state variables increases. In this paper, we build on the concept of influence-based abstraction wh ...

ICA based on Split Generalized Gaussian

Journal article (2019) - Przemyslaw Spurek (author) , Przemys law Rola (author) , Jacek Tabor (author) , Aleksander Czechowski (author) , Andrzej Bedychaj (author)

Independent Component Analysis (ICA) is a method for searching the linear transformation that minimizes the statistical dependence between its components. Most popular ICA methods use kurtosis as a metric of independence (non-Gaussianity) to maximize, such as FastICA and JADE. Ho ...