Search results | TU Delft Repositories

Searched for: subject%3A%22reinforcement%255C+learning%22

(1 - 7 of 7)

document: Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach
He, K. (author), Shi, S. (author), van den Boom, A.J.J. (author), De Schutter, B.H.K. (author)
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies...
journal article 2024

document: Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO
Chen, Yangkun (author), Yu, Chenghui (author), Zhu, Hengman (author), Liu, Shuai (author), Zhang, Yibing (author), Suarez, Joseph (author), Zhao, Liang (author), He, J. (author), Chen, Jiaxin (author)
We present the results of the second Neural MMO challenge, hosted at IJCAI 2022, which received 1600+ submissions. This competition targets robustness and generalization in multi-agent systems: participants train teams of agents to complete a multi-task objective against opponents not seen during training. We summarize the competition design...
journal article 2023

document: Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
Suau, M. (author), He, J. (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which that data can be obtained. In this paper, we study how to build lightweight simulators of complicated systems that can run sufficiently fast for...
conference paper 2022

document: Influence-aware memory architectures for deep reinforcement learning in POMDPs
Suau, M. (author), He, J. (author), Congeduti, E. (author), Starre, R.A.N. (author), Czechowski, A.T. (author), Oliehoek, F.A. (author)
Due to its perceptual limitations, an agent may have too little information about the environment to act optimally. In such cases, it is important to keep track of the action-observation history to uncover hidden state information. Recent deep reinforcement learning methods use recurrent neural networks (RNN) to memorize past observations....
journal article 2022

document: Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators
Suau, M. (author), He, J. (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which that data can be obtained. In this paper, we study how to build lightweight simulators of complicated systems that can run sufficiently fast for...
conference paper 2022

document: A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves
Han, Yu (author), Hegyi, A. (author), Zhang, Le (author), He, Zhengbing (author), Chung, Edward (author), Liu, Pan (author)
Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based...
journal article 2022

document: A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV
He, Nan (author), Yang, S. (author), Li, Fan (author), Trajanovski, S. (author), Kuipers, F.A. (author), Fu, Xiaoming (author)
The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of...
conference paper 2021

Searched for: subject%3A%22reinforcement%255C+learning%22

(1 - 7 of 7)