Search results | TU Delft Repositories

document

Models and heuristics for hard routing and knapsack problems

Pierotti, J. (author)

One of the world’s biggest challenges is that living beings have to share a limited amount of resources. As people of science, we strive to find innovative ways to better use these resources, to reach and positively affect more and more people. In the field of optimization, we aim at finding an optimal allocation of limited sets of resources to...

doctoral thesis 2022

document

The Feasibility of Reinforcement Learning as a replacement for Mixed-Integer Programming for Smart Charging of Electric Vehicles

Becker, Simion (author)

As the world is currently actively trying to reduce the consumption of fossil fuels, large investments are done in renewable energy sources and ways are sought after to electrify fossil fuel-intensive sectors. In line with these developments, the number of electric vehicles requiring access to the electric power grid has exploded putting...

master thesis 2022

document

Offline-Online Reoptimization using a Hybrid Method

de Vringer, Stefan (author)

Realistic vehicle routing problems have been highly relevant for years in a wide variety of domains. One such domain is food delivery, where well-crafted routes can reduce costs and contribute to customer satisfaction. This thesis formulates a problem variant for the restaurant meal delivery problem in order to examine the reoptimization of meal...

master thesis 2022

document

Combining MPC and reinforcement learning in a model-reference framework for urban traffic signal control

Remmerswaal, Willemijn (author)

Both model predictive control (MPC) and reinforcement learning (RL) have shown promising results in the control of traffic signals in urban traffic networks. There are, however, a few drawbacks. MPC controllers are not adaptive and therefore perform suboptimal in the presence of the uncertainties that always occur in urban traffic systems....

master thesis 2022

document

REAL Reinforcement Learning: Planning with adversarial models

Foffano, Daniele (author)

Model-Based Reinforcement Learning (MBRL) algorithms solve sequential decision-making problems, usually formalised as Markov Decision Processes, using a model of the environment dynamics to compute the optimal policy. When dealing with complex environments, the environment dynamics are frequently approximated with function approximators (such as...

master thesis 2022

document

Back to the Future: Solving Hidden Parameter MDPs with Hindsight

Ponnambalam, C.T. (author), Kamran, Danial (author), Simão, T. D. (author), Oliehoek, F.A. (author), Spaan, M.T.J. (author)

conference paper 2022

document

Lateral and Vertical Air Traffic Control Under Uncertainty Using Reinforcement Learning

Badea, C. (author), Groot, D.J. (author), Morfin Veytia, A. (author), Ribeiro, M.J. (author), Dalmau, Ramon (author), Ellerbroek, Joost (author), Hoekstra, J.M. (author)

Air traffic demand has increased at an unprecedented rate in the last decade (albeit interrupted by the COVID pandemic), but capacity has not increased at the same rate. Higher levels of automation and the implementation of decision-support tools for air traffic controllers could help increase capacity and catch up with demand. The air traffic...

conference paper 2022

document

Event-Based Communication in Distributed Q-Learning

Jarne Ornia, D. (author), Mazo, M. (author)

We present an approach to reduce the communication of information needed on a Distributed Q-Learning system inspired by Event Triggered Control (ETC) techniques. We consider a baseline scenario of a Distributed Q-Learning problem on a Markov Decision Process (MDP). Following an event-based approach, N agents sharing a value function explore the...

conference paper 2022

document

Optimal dispatch of PV inverters in unbalanced distribution systems using Reinforcement Learning

Vergara Barrios, P.P. (author), Salazar, Mauricio (author), Giraldo, Juan S. (author), Palensky, P. (author)

In this paper, a Reinforcement Learning (RL)-based approach to optimally dispatch PV inverters in unbalanced distribution systems is presented. The proposed approach exploits a decentralized architecture in which PV inverters are operated by agents that perform all computational processes locally; while communicating with a central agent to...

journal article 2022

document

Robust Event-Driven Interactions in Cooperative Multi-agent Learning

Jarne Ornia, D. (author), Mazo, M. (author)

We present an approach to safely reduce the communication required between agents in a Multi-Agent Reinforcement Learning system by exploiting the inherent robustness of the underlying Markov Decision Process. We compute robustness certificate functions (off-line), that give agents a conservative indication of how far their state measurements...

conference paper 2022

document

From Supervised to Reinforcement Learning: an Inverse Optimization Approach

Dimanidis, Ioannis (author)

We propose a novel method combining elements of supervised- and Q-learning for the control of dynamical systems subject to unknown disturbances. By using the Inverse Optimization framework and in-hindsight information we can derive a causal parametric optimization policy that approximates a non-causal MPC expert. Furthermore, we propose a new...

master thesis 2021

document

Simulating an Scheduling Airport Security Checkpoints: Q-Learning-Based Allocation of Operators to Security Teams at an Airport Security Checkpoint

Köstler, Klemens (author)

In this paper, we propose and analyze a q-learning-based approach for allocation of operators to security teams in order to improve operational efficiency of an airport security checkpoint. The research is composed of two parts. First, we develop an agent-based model capable of simulating an airport security checkpoint. Second, we introduce...

master thesis 2021

document

Reinforcement Learning for Quadcopter Fault Tolerance: Analyzing Hierarchical and Online Deep Reinforcement Learning for Quadcopter Fault Tolerant Control

Bello, Riccardo (author)

The demand of adding fault tolerance to quadcopter control systems has significantly increased with the rise of adoption of UAVs in numerous sectors. This work proposes and demonstrates the use of Hierarchical Reinforcement Learning to control a quadcopter subject to severe actuator fault. State-of-the-art algorithms are implemented, and a...

master thesis 2021

document

Know what it does not know: Improving Offline Deep Reinforcement Learning with Uncertainty Estimation

Smit, Jordi (author)

Offline reinforcement learning, or learning from a fixed data set, is an attractive alternative to online reinforcement learning. Offline reinforcement learning promises to address the cost and safety implications of taking numerous random or bad actions online, which is a crucial aspect of traditional reinforcement learning that makes it...

master thesis 2021

document

Reinforcement learning for order distribution in self-organizing logistics

de Vries, Yorick (author)

With the increasing global demand for logistics, supply chains have grown a lot in volume over the last decades. To be able to operate effectively within the capacity constraints of the carriers, proper collaboration and optimization of order allocation is required. Van Berkel Logistics facilitates the transport of containers by trucks from sea...

master thesis 2021

document

End-to-End Hierarchical Reinforcement Learning for Adaptive Flight Control: A method for model-independent control through Proximal Policy Optimization with learned Options

Ge, Zhouxin (author)

Aircraft with disruptive designs have no high-fidelity and accurate flight models. At the same time, developing models for stochastic phenomena for traditional aircraft configurations are costly, and classical control methods cannot operate beyond the predefined operation points or adapt to unexpected changes to the aircraft. The Proximal Policy...

master thesis 2021

document

Bus management using multi-agent reinforcement learning

Weijs, George (author)

Bus bunching is a problem that occurs in many high frequent bus systems. This can be averted by several countermeasures of which holding control is the most popular one in practice. Holding control strategies are often implemented using predefined rules. In this study, multi-agent reinforcement learning is selected to develop an effective...

master thesis 2021

document

Using reinforcement learning for finding the optimal quantum annealing schedule

Dijksman, Calvin (author)

For NP-hard optimisation problems no polynomial-time algorithms exist for finding a solution. Therefore, heuristic methods are often used, especially when approximate solutions can be satisfactory. One such method is quantum annealing, a method where some initial Hamiltonian is slowly perturbed to anneal towards a problem Hamiltonian. The...

bachelor thesis 2021

document

Log-Based Behavioral System Model Inference Using Reinforcement Learning

Symeonidis, Pandelis (author)

System behavior models are highly useful for the developers of the system as they aid in system comprehension, documentation, and testing. Even though methods to obtain such models exist, e.g. profiling, tracing, source code inference and existing log-based inference methods, they can not successfully be applied to the case of large, real-time...

bachelor thesis 2021

document

Music Playlist Generation for Emotion Regulation: A Functional Component in the Individualized Music Intervention for Persons with Dementia

Kreynen, Bernd (author)

Dementia care is a growing problem, both due to a rising number of cases and due to a shortage in healthcare workers. Aside from cognitive symptoms persons with dementia (PwDs) often deal with psychological symptoms such as agitation. The individualized music intervention (IMI) by Linda Gerdner has been proposed to reduce these. This is the...

master thesis 2021

Pages

Pages