Search results | TU Delft Repositories

Searched for: collection%253Air

(1 - 6 of 6)

document: Online Planning in POMDPs with Self-Improving Simulators
He, J. (author), Suau, M. (author), Baier, Hendrik (author), Kaisers, Michael (author), Oliehoek, F.A. (author)
How can we plan efficiently in a large and complex environment when the time budget is limited? Given the original simulator of the environment, which may be computationally very demanding, we propose to learn online an approximate but much faster simulator that improves over time. To plan reliably and efficiently while the approximate simulator...
conference paper 2022

document: Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
Suau, M. (author), He, J. (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which that data can be obtained. In this paper, we study how to build lightweight simulators of complicated systems that can run sufficiently fast for...
conference paper 2022

document: Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Suau, M. (author), He, J. (author), Çelikok, Mustafa Mert (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Due to its high sample complexity, simulation is, as of today, critical for the successful application of reinforcement learning. Many real-world problems, however, exhibit overly complex dynamics, which makes their full-scale simulation computationally slow. In this paper, we show how to factorize large networked systems of many agents into...
conference paper 2022

document: Influence-aware memory architectures for deep reinforcement learning in POMDPs
Suau, M. (author), He, J. (author), Congeduti, E. (author), Starre, R.A.N. (author), Czechowski, A.T. (author), Oliehoek, F.A. (author)
Due to its perceptual limitations, an agent may have too little information about the environment to act optimally. In such cases, it is important to keep track of the action-observation history to uncover hidden state information. Recent deep reinforcement learning methods use recurrent neural networks (RNN) to memorize past observations....
journal article 2022

document: Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators
Suau, M. (author), He, J. (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which that data can be obtained. In this paper, we study how to build lightweight simulators of complicated systems that can run sufficiently fast for...
conference paper 2022

document: Influence-Augmented Online Planning for Complex Environments
He, J. (author), Suau, M. (author), Oliehoek, F.A. (author)
How can we plan efficiently in real time to control an agent in a complex environment that may involve many other agents? While existing sample-based planners have enjoyed empirical success in large POMDPs, their performance heavily relies on a fast simulator. However, real-world scenarios are complex in nature and their simulators are often...
journal article 2020

Searched for: collection%253Air

(1 - 6 of 6)