Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

None, None; None, None; None, None; None, None

Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

Journal Article (2018)

Author(s)

Koppány Máthé (Technical University of Cluj-Napoca)

Lucian Busoniu (Technical University of Cluj-Napoca)

R Munos (Google DeepMind)

Bart De De Schutter (TU Delft - Team Bart De Schutter)

Research Group

Team Bart De Schutter

DOI related publication

https://doi.org/10.1016/j.engappai.2017.08.020

Planning Nonlinear predictive control Optimal control Near-optimality analysis

To reference this document use:

https://resolver.tudelft.nl/uuid:ad17f525-8ea2-4507-90c2-0a085530a73a

More Info

expand_more

Publication Year

2018

Language

English

Research Group

Team Bart De Schutter

Volume number

67

Pages (from-to)

355-367

Abstract

We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning is difficult, we introduce an adaptive variant called OASP that automatically adjusts so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor.

No files available

Metadata only record. There are no files for this record.