Searched for: subject%3A%22Reinforcement%255C%2BLearning%255C%2B%255C%2528RL%255C%2529%22
(21 - 32 of 32)
Pages
- 1
- 2
- document
-
Ribeiro, M.J. (author), Ellerbroek, Joost (author), Hoekstra, J.M. (author)Current investigations into urban aerial mobility, as well as the continuing growth of global air transportation, have renewed interest in conflict detection and resolution (CD&R) methods. The use of drones for applications such as package delivery, would result in traffic densities that are orders of magnitude higher than those currently...journal article 2021
- document
-
Rijsdijk, Jorai (author)Side-channel attacks (SCA), which use unintended leakage to retrieve a secret cryptographic key, have become more sophisticated over time. With the recent successes of machine learning (ML) and especially deep learning (DL) techniques against cryptographic implementations even in the presence of dedicated countermeasures, various methods have...master thesis 2020
- document
-
van der Toorn, Eric (author)A recent advancement in Reinforcement Learning is the capability of modelling opponents. In this work, we are interested in going back to basics and testing this capability within the Iterated Prisoner's Dilemma, a simple method for modelling multi agent systems. Using the self modelling advantage actor critic model, we set up a single agent...bachelor thesis 2020
- document
-
Fris, Rein (author)Deep Reinforcement Learning (DRL) enables us to design controllers for complex tasks with a deep learning approach. It allows us to design controllers that are otherwise cumbersome to design with conventional control methodologies. Often, an objective for RL is binary in nature. However, exploring in environments with sparse rewards is a problem...master thesis 2020
- document
-
Martens, Vera (author)A Taxi Dispatch Problem involves assigning taxis to requests of passengers who are waiting at different locations for a trip. In today's economy and society, the Taxi Dispatch Problem and other transport problems can be found everywhere. Not only in transporting people, but also in food delivery from restaurants and package delivery for all kind...bachelor thesis 2020
- document
-
Ribeiro, M.J. (author), Ellerbroek, J. (author), Hoekstra, J.M. (author)The use of drones for applications such as package delivery, in an urban setting, would result in traffic densities that are orders of magnitude higher than any observed in manned aviation. Current geometric resolution models have proven to be very efficient. However, at the extreme densities envisioned for such drone applications, performance...conference paper 2020
- document
-
Yuan, Haoran (author)The control of aircraft can be carried out by Reinforcement Learning agents; however, the difficulty of obtaining sufficient training samples often makes this approach infeasible. Demonstrations can be used to facilitate the learning process, yet algorithms such as Apprenticeship Learning generally fail to produce a policy that outperforms the...master thesis 2019
- document
-
van Dam, Geart (author)This research investigates and proposes a new method for obstacle detection and avoidance on quadrotors. One that does not require the addition of any sensors, but relies solely on measurements from the accelerometer and rotor controllers. The detection of obstacles is based on the principle that the airflow around a quadrotor changes when the...master thesis 2019
- document
-
Meulman, Erik (author)Model-based evolutionary algorithms (MBEAs) are praised for their broad applicability to black-box optimization problems. In practical applications however, they are mostly used to repeatedly optimize different instances of a single problem class, a setting in which specialized algorithms generally perform better. In this paper, we introduce the...master thesis 2019
- document
-
Kroezen, Dave (author)Online Reinforcement Learning is a possible solution for adaptive nonlinear flight control. In this research an Adaptive Critic Design (ACD) based on Dual Heuristic Dynamic Programming (DHP) is developed and implemented on a simulated Cessna Citation 550 aircraft. Using an online identified system model approximation, the method is independent...master thesis 2019
- document
-
Mannucci, T. (author), van Kampen, E. (author), de Visser, C.C. (author), Chu, Q. P. (author)Self-learning approaches, such as reinforcement learning, offer new possibilities for autonomous control of uncertain or time-varying systems. However, exploring an unknown environment under limited prediction capabilities is a challenge for a learning agent. If the environment is dangerous, free exploration can result in physical damage or in...journal article 2018
- document
-
Zafar, Shanza (author)To solve the problem of optimal control for nonlinear system, Actor Critic Designs (ACD) can be utilized which use the concept of Reinforcement learning (RL) and function approximators such as Neural networks (NN). Traditional ACD methods require a model NN that needs to be trained offline. Recently, research focus has been shifted to model-free...master thesis 2018
Searched for: subject%3A%22Reinforcement%255C%2BLearning%255C%2B%255C%2528RL%255C%2529%22
(21 - 32 of 32)
Pages
- 1
- 2