Javier Ruiz-del-Solar | TU Delft Repository

Interactive Learning of Temporal Features for Control

Shaping Policies and State Representations From Human Feedback

Journal article (2020) - Rodrigo Pérez-Dattari (author) , Carlos Celemin (author) , G. Franzese (author) , Javier Ruiz-Del-Solar (author) , J. Kober (author)

Current ongoing industry revolution demands more flexible products, including robots in household environments and medium-scale factories. Such robots should be able to adapt to new conditions and environments and be programmed with ease. As an example, let us suppose that there ...

Interactive Learning with Corrective Feedback for Policies Based on Deep Neural Networks

Conference paper (2020) - Rodrigo Pérez-Dattari (author) , Carlos Celemin (author) , Javier Ruiz-Del-Solar (author) , J. Kober (author)

Deep Reinforcement Learning (DRL) has become a powerful strategy to solve complex decision making problems based on Deep Neural Networks (DNNs). However, it is highly data demanding, so unfeasible in physical systems for most applications. In this work, we approach an alternative ...

Continuous control for high-dimensional state spaces

An interactive learning approach

Conference paper (2019) - Rodrigo Pérez-Dattari (author) , Carlos Celemin (author) , Javier Ruiz-Del-Solar (author) , J. Kober (author)

Deep Reinforcement Learning (DRL) has become a powerful methodology to solve complex decision-making problems. However, DRL has several limitations when used in real-world problems (e.g., robotics applications). For instance, long training times are required and cannot be acceler ...

Reinforcement learning of motor skills using Policy Search and human corrective advice

Journal article (2019) - Carlos Celemin (author) , Guilherme Maeda (author) , Javier Ruiz-Del-Solar (author) , Jan Peters (author) , Jens Kober (author)

Robot learning problems are limited by physical constraints, which make learning successful policies for complex motor skills on real systems unfeasible. Some reinforcement learning methods, like Policy Search, offer stable convergence toward locally optimal solutions, whereas in ...

A fast hybrid reinforcement learning framework with human corrective feedback

Journal article (2018) - Carlos Celemin (author) , Javier Ruiz-del-Solar (author) , J. Kober (author)

Reinforcement Learning agents can be supported by feedback from human teachers in the learning loop that guides the learning process. In this work we propose two hybrid strategies of Policy Search Reinforcement Learning and Interactive Machine Learning that benefit from both sour ...

Decentralized Reinforcement Learning of robot behaviors

Journal article (2018) - David L. Leottau (author) , Javier Ruiz-del-Solar (author) , Robert Babuska (author)

A multi-agent methodology is proposed for Decentralized Reinforcement Learning (DRL) of individual behaviors in problems where multi-dimensional action spaces are involved. When using this methodology, sub-tasks are learned in parallel by individual agents working toward a common ...

Decentralized reinforcement learning applied to mobile robots

Conference paper (2017) - David L. Leottau (author) , Aashish Vatsyayan (author) , Javier Ruiz-Del-Solar (author) , Robert Babuska (author)

In this paper, decentralized reinforcement learning is applied to a control problem with a multidimensional action space. We propose a decentralized reinforcement learning architecture for a mobile robot, where the individual components of the commanded velocity vector are learne ...

Human corrective advice in the policy search loop

Abstract (2017) - Carlos Celemin (author) , Guilherme Maeda (author) , J. Kober (author) , Javier Ruiz-del-Solar (author)

Machine Learning methods applied to decision making problems with real robots usually suffer from slow convergence due to the dimensionality of the search and difficulties in the reward design. Interactive Machine Learning (IML) or Learning from Demonstrations (LfD) methods are u ...