Search results | TU Delft Repositories

document

Multi-Agent Actor-Critic Reinforcement Learning for Cooperative Tasks

Bayiz, Y.E. (author)

For single-agent problems, Reinforcement Learning (RL) algorithms proved to be useful learning optimal control laws for nonlinear dynamic systems without relying on a mathematical model of the system to be controlled. With their ability to work on continuous action and state spaces, actor-critic RL algorithms are especially advantageous in that...

master thesis 2014

document

Reward system design for incorporating control performance

Nagaki, K. (author)

Reinforcement learning (RL) is a machine learning technique whereby the controller learns the control law by optimizing the received cumulative amount of reward. A reward is an instantaneous evaluation of the applied action at the current state, given by reward function. However in theory the reward function is assumed to be given, in practice...

master thesis 2015

document

Control by Interconnection using Reinforcement Learning

Bhattacharjee, A. (author)

The dynamics of many physical processes can be described by port-Hamiltonian (PH) models where the importance of the energy function can be seen. In Control by Interconnection (CbI), the controller is another PH system that is connected to the plant through a power preserving interconnection to add up the energy functions. However, a major issue...

master thesis 2015

document

Deep Reinforcement Learning - Pretraining actor-critic networks using state representation learning

Munk, J. (author)

In control, the objective is to find a mapping from states to actions that steer a system to a desired reference. A controller can be designed by an engineer, typically using some model of the system or it can be learned by an algorithm. Reinforcement Learning (RL) is one such algorithm. In RL, the controller is an agent that interacts with the...

master thesis 2016

document

Automatic synthesis of supervisory control systems

Najafi, E. (author)

Sequential composition is an effective supervisory control method for addressing control problems in nonlinear dynamical systems. It executes a set of controllers sequentially to achieve a control specification that cannot be realized by a single controller. Sequential composition focuses on the interaction between a collection of pre-designed...

doctoral thesis 2016

document

Use of Affordances for Efficient Robot Learning

Wang, C. (author)

doctoral thesis 2017

document

Intelligent Flapping Wing Control: Reinforcement Learning for the DelFly

Goedhart, Menno (author)

Flight control of the DelFly is challenging, because of its complex dynamics and variability due to manufacturing inconsistencies. Machine Learning algorithms can be used to tackle these challenges. A Policy Gradient algorithm is used to tune the gains of a Proportional-Integral controller using Reinforcement Learning. Furthermore, a novel...

master thesis 2017

document

Deep Reinforcement Learning for Bipedal Robots

Rastogi, Divyam (author)

Reinforcement Learning (RL) is a general purpose framework for designing controllers for non-linear systems. It tries to learn a controller (policy) by trial and error. This makes it highly suitable for systems which are difficult to control using conventional control methodologies, such as walking robots. Traditionally, RL has only been...

master thesis 2017

document

Potential Field Methods for Safe Reinforcement Learning: Exploring Q-Learning and Potential Fields

Bhowal, Abhranil (author)

A Reinforcement Learning (RL) agent learns about its environment through exploration. For most physical applications such as search and rescue UAVs, this exploration must take place with safety in mind. Unregulated exploration, especially at the beginning of a run, will lead to fatal situations such as crashes. One approach to mitigating these...

master thesis 2017

document

Reinforcement Learning Across Timescales

Ravi, Siddharth (author)

This project addresses a fundamental problem faced by many reinforcement learning agents. Commonly used reinforcement learning agents can be seen to have deteriorating performances at increasing frequencies, as they are unable to correctly learn the ordering of expected returns for actions that are applied. We call this the disappearing...

master thesis 2017

document

Adaptive-Critic Designs For Aircraft Control: A comparative study between Heuristic Dynamic Programming and Dual Heuristic Programming

Pohl, Franz (author)

The Variable Camber Continuous Trailing Edge Flap (VCCTEF) is a novel aircraft control system that intents to prevent undesired aeroelastic deflections by precise lift tailoring along the wing span. However, the unknown dynamics and increased complexity of the new hardware imposes difficulties to establish an optimal controller. One approach is...

master thesis 2017

document

Safe Online Robust Exploration for Reinforcement Learning Control of Unmanned Aerial Vehicles

Mannucci, T. (author)

doctoral thesis 2017

document

Directed Increment Policy Search for Behavior Tree Task Performance Optimization: Crossing the Reality Gap

Leest, Steven (author)

Robotic behavior policies learned in simulation suffer from a performance degradation once transferred to a real-world robotic platform. This performance degradation originates from discrepancies between the real-world and simulation environment, referred to as the reality gap. To cross the reality gap, this papers presents a simple...

master thesis 2017

document

Kinematic Synthesis using Reinforcement Learning

Vermeer, Kaz (author)

Advanced tools such as machine learning are slowly finding their way into the modern scientist’s toolbox . In the design of mechanical systems however hardly any machine learning applications are being used. Research into the viability of such an application is therefore necessary.<br/>We have performed such research, using a specific type of...

master thesis 2017

document

Smart Start: A Directed and Persistent Exploration Framework for Reinforcement Learning

Keulen, Bart (author)

An important problem in reinforcement learning is the exploration-exploitation dilemma. Especially for environments with sparse or misleading rewards it has proven difficult to construct a good exploration strategy. For discrete domains good exploration strategies have been devised, but are often nontrivial to implement on more complex domains...

master thesis 2018

document

Online reinforcement learning control for aerospace systems

Zhou, Y. (author)

Reinforcement Learning (RL) methods are relatively new in the field of aerospace guidance, navigation, and control. This dissertation aims to exploit RL methods to improve the autonomy and online learning of aerospace systems with respect to the a priori unknown system and environment, dynamical uncertainties, and partial observability. In the...

doctoral thesis 2018

document

Vision-Based Reinforcement Learning for the guidance of an AR Drone 2

Siddiquee, Manan (author)

Reinforcement Learning (RL) has been applied to teach quadcopters guidance tasks. Most applications rely on position information from an absolute reference<br/>system such as Global Positioning System (GPS). The dependence on "absolute<br/>position" information is a general limitation in the autonomous flight of Unmanned Aerial Vehicles (UAVs)....

master thesis 2018

document

Limit order placement optimization with Deep Reinforcement Learning: Learning from patterns in cryptocurrency market data

Juchli, Marc (author)

For various reasons, financial institutions often make use of high-level trading strategies when buying and selling assets. Many individuals, irrespective or their level of prior trading knowledge, have recently entered the field of trading due to the increasing popularity of cryptocurrencies, which offer a low entry barrier for trading....

master thesis 2018

document

Robotic Auxiliary Losses for continuous reinforcement learning

Cherici, Teo (author)

Recent advancements in computation power and artificial intelligence have allowed the creation of advanced reinforcement learning models which could revolutionize, between others, the field of robotics. As model and environment complexity increase, however, training solely through the feedback of environment reward becomes more difficult. From...

master thesis 2018

document

Action Selection Policies for Walking Monte Carlo Tree Search

Starre, Rolf (author)

Recent Reinforcement Learning methods have combined function approximation and Monte Carlo Tree Search and are able to learn by self-play up to a very high level in several games such as Go and Hex. One aspect in this combination<br/>that has not had a lot of attention is the action selection policy during self-play, which could influence the...

master thesis 2018

Pages

Pages