Early Stopping Bayesian Optimization for Controller Tuning

Conference Paper (2025)
Author(s)

David Stenger (RWTH Aachen University)

Dominik Scheurenberg (RWTH Aachen University)

H. Vallery (TU Delft - Biomechatronics & Human-Machine Control, RWTH Aachen University, Erasmus MC)

Sebastian Trimpe (RWTH Aachen University)

Research Group
Biomechatronics & Human-Machine Control
DOI related publication
https://doi.org/10.1109/CDC56724.2024.10886051
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Biomechatronics & Human-Machine Control
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Pages (from-to)
3747-3753
ISBN (electronic)
979-8-3503-1633-9
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Manual tuning of performance-critical controller parameters can be tedious and sub-optimal. Bayesian Optimization (BO) is an increasingly popular practical alternative to automatically optimize controller parameters from few experiments. Standard BO practice is to evaluate the closed-loop performance of parameters proposed during optimization on an episode with a fixed length. However, fixed-length episodes can be wasteful. For example, continuing an episode where already the start shows undesirable behavior such as strong oscillations seems pointless. Therefore, we propose a BO method that stops an episode early if suboptimality becomes apparent before an episode is completed. Such early stopping results in partial observations of the controller’s performance, which cannot directly be included in standard BO. We propose three heuristics to facilitate partially observed episodes in BO. Through five numerical and one hardware experiment, we demonstrate that early stopping BO can substantially reduce the time needed for optimization.

Files

License info not available
warning

File under embargo until 26-08-2025