Data-driven construction of symbolic process models for reinforcement learning

None, None; None, None; None, None

Data-driven construction of symbolic process models for reinforcement learning

Conference Paper (2018)

Author(s)

Erik Derner (Czech Technical University)

Jiri Kubalik (Czech Technical University)

R. Babuška (Czech Technical University, TU Delft - Learning & Autonomous Control)

Research Group

Learning & Autonomous Control

DOI related publication

https://doi.org/10.1109/ICRA.2018.8461182

Reinforcement learning Optimal control Symbolic regression Model learning for control AI-based methods

To reference this document use:

https://resolver.tudelft.nl/uuid:1516a607-f908-49bb-9aa5-a3d7e8e5ce9a

More Info

expand_more

Publication Year

2018

Language

English

Research Group

Learning & Autonomous Control

Pages (from-to)

5105-5112

ISBN (electronic)

978-1-5386-3081-5

Abstract

Reinforcement learning (RL) is a suitable approach for controlling systems with unknown or time-varying dynamics. RL in principle does not require a model of the system, but before it learns an acceptable policy, it needs many unsuccessful trials, which real robots usually cannot withstand. It is well known that RL can be sped up and made safer by using models learned online. In this paper, we propose to use symbolic regression to construct compact, parsimonious models described by analytic equations, which are suitable for realtime robot control. Single node genetic programming (SNGP) is employed as a tool to automatically search for equations fitting the available data. We demonstrate the approach on two benchmark examples: a simulated mobile robot and the pendulum swing-up problem; the latter both in simulations and real-time experiments. The results show that through this approach we can find accurate models even for small batches of training data. Based on the symbolic model found, RL can control the system well

No files available

Metadata only record. There are no files for this record.