Searched for: +
(41 - 60 of 115)

Pages

document
Oren, Yaniv (author)
Deep, model based reinforcement learning has shown state of the art, human-exceeding performance in many challenging domains. <br/>Low sample efficiency and limited exploration remain however as leading obstacles in the field. <br/>In this work, we incorporate epistemic uncertainty into planning for better exploration.<br/>We develop a low-cost...
master thesis 2022
document
Völker, Willem (author)
Recent research on the Flying V - a flying-wing long-range passenger aircraft - shows that its airframe design is 25% more aerodynamically efficient than a conventional tube-and-wing airframe. The Flying V is therefore a promising contribution towards reduction in climate impact of long-haul flights. However, some design aspects of the Flying V...
master thesis 2022
document
de Haro Pizarroso, Gabriel (author)
Reinforcement Learning is being increasingly applied to flight control tasks, with the objective of developing truly autonomous flying vehicles able to traverse highly variable environments and adapt to unknown situations or possible failures. However, the development of these increasingly complex models and algorithms further reduces our...
master thesis 2022
document
Mija, Andrei (author)
Agents trained through single-agent reinforcement learning methods such as self-play can provide a good level of performance in multi-agent settings and even in fully cooperative environments. However, most of the time, training multiple agents together using single-agent self-play yields poor results as each agent tries to learn how to perform...
bachelor thesis 2022
document
Ordonez Cardenas, Nathan (author)
A longstanding problem in the area of reinforcement learning is human-agent col- laboration. As past research indicates that RL agents undergo a distributional shift when they start collaborating with human beings, the goal is to create agents that can adapt. We build upon research using the two-player Overcooked environment to repro- duce a...
bachelor thesis 2022
document
Pantea, Luca (author)
Recommender Systems play a significant part in filtering and efficiently prioritizing relevant information to alleviate the information overload problem and maximize user engagement. Traditional recommender systems employ a static approach towards learning the user's preferences, relying on logged previous interactions with the system,...
bachelor thesis 2022
document
Gaghi, Radu (author)
This paper introduces a strategy for learning opponent parameters in automated negotiation and using them for future negotiation sessions. The goal is to maximize the agent’s utility while being consistent in its performance over various negotiation scenarios. While a number of reinforcement learning approaches in the field have used Q-learning,...
bachelor thesis 2022
document
Agrawal, Arpit (author)
With the prospects of decentralized multi-agent systems becoming more prevalent in daily life, automated negotiation agents have made their place in these collaborative settings. They are an approach to promote communication between the agents in reaching solutions that are better for all involved.<br/><br/>Recent literature has shown great...
bachelor thesis 2022
document
GUO, Zhuoran (author)
Online searching for healthcare information has gradually become a widely used internet case. Suppose a patient suffers the symptom but is unsure of the action he needs to take, a self-diagnosis tool can help the patient identify the possible conditions and whether this patient needs to seek immediate medical help. However, the accuracy and...
master thesis 2022
document
Becker, Simion (author)
As the world is currently actively trying to reduce the consumption of fossil fuels, large investments are done in renewable energy sources and ways are sought after to electrify fossil fuel-intensive sectors. In line with these developments, the number of electric vehicles requiring access to the electric power grid has exploded putting...
master thesis 2022
document
de Vringer, Stefan (author)
Realistic vehicle routing problems have been highly relevant for years in a wide variety of domains. One such domain is food delivery, where well-crafted routes can reduce costs and contribute to customer satisfaction. This thesis formulates a problem variant for the restaurant meal delivery problem in order to examine the reoptimization of meal...
master thesis 2022
document
Remmerswaal, Willemijn (author)
Both model predictive control (MPC) and reinforcement learning (RL) have shown promising results in the control of traffic signals in urban traffic networks. There are, however, a few drawbacks. MPC controllers are not adaptive and therefore perform suboptimal in the presence of the uncertainties that always occur in urban traffic systems....
master thesis 2022
document
Foffano, Daniele (author)
Model-Based Reinforcement Learning (MBRL) algorithms solve sequential decision-making problems, usually formalised as Markov Decision Processes, using a model of the environment dynamics to compute the optimal policy. When dealing with complex environments, the environment dynamics are frequently approximated with function approximators (such as...
master thesis 2022
document
Dimanidis, Ioannis (author)
We propose a novel method combining elements of supervised- and Q-learning for the control of dynamical systems subject to unknown disturbances. By using the Inverse Optimization framework and in-hindsight information we can derive a causal parametric optimization policy that approximates a non-causal MPC expert. Furthermore, we propose a new...
master thesis 2021
document
Köstler, Klemens (author)
In this paper, we propose and analyze a q-learning-based approach for allocation of operators to security teams in order to improve operational efficiency of an airport security checkpoint. The research is composed of two parts. First, we develop an agent-based model capable of simulating an airport security checkpoint. Second, we introduce...
master thesis 2021
document
Bello, Riccardo (author)
The demand of adding fault tolerance to quadcopter control systems has significantly increased with the rise of adoption of UAVs in numerous sectors. This work proposes and demonstrates the use of Hierarchical Reinforcement Learning to control a quadcopter subject to severe actuator fault. State-of-the-art algorithms are implemented, and a...
master thesis 2021
document
Smit, Jordi (author)
Offline reinforcement learning, or learning from a fixed data set, is an attractive alternative to online reinforcement learning. Offline reinforcement learning promises to address the cost and safety implications of taking numerous random or bad actions online, which is a crucial aspect of traditional reinforcement learning that makes it...
master thesis 2021
document
de Vries, Yorick (author)
With the increasing global demand for logistics, supply chains have grown a lot in volume over the last decades. To be able to operate effectively within the capacity constraints of the carriers, proper collaboration and optimization of order allocation is required. Van Berkel Logistics facilitates the transport of containers by trucks from sea...
master thesis 2021
document
Ge, Zhouxin (author)
Aircraft with disruptive designs have no high-fidelity and accurate flight models. At the same time, developing models for stochastic phenomena for traditional aircraft configurations are costly, and classical control methods cannot operate beyond the predefined operation points or adapt to unexpected changes to the aircraft. The Proximal Policy...
master thesis 2021
document
Weijs, George (author)
Bus bunching is a problem that occurs in many high frequent bus systems. This can be averted by several countermeasures of which holding control is the most popular one in practice. Holding control strategies are often implemented using predefined rules. In this study, multi-agent reinforcement learning is selected to develop an effective...
master thesis 2021
Searched for: +
(41 - 60 of 115)

Pages