State-Action Control Barrier Functions

None, None; None, None; None, None; None, None

State-Action Control Barrier Functions

Imposing Safety on Learning-Based Control With Low Online Computational Costs

Journal Article (2026)

Author(s)

Kanghui He (TU Delft - Team Bart De Schutter)

Shengling Shi (Massachusetts Institute of Technology)

Ton Van Den Boom (TU Delft - Team Ton van den Boom)

Bart De Schutter (TU Delft - Delft Center for Systems and Control)

Machine learning Nonlinear control Control barrier functions Constrained control

DOI related publication

https://doi.org/10.1109/TAC.2025.3636804 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:5e9bddcc-e47f-4bdd-b933-ea50b45bea50

More Info

expand_more

Publication Year

2026

Language

English

Journal title

IEEE Transactions on Automatic Control

Issue number

5

Volume number

71

Pages (from-to)

3365-3371

Downloads counter

6

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Learning-based control with safety guarantees usually requires real-time safety certification and modifications of possibly unsafe learning-based policies. The control barrier function (CBF) method uses a safety filter (SF) containing a constrained optimization problem to produce safe policies. However, finding a valid CBF for a general nonlinear system requires a complex function parameterization, which in general makes the policy optimization problem difficult to solve in real time. For nonlinear systems with nonlinear state constraints, this paper proposes the novel concept of state-action CBFs (SACBFs), which do not only characterize the safety at each state but also evaluate the control inputs taken at each state. SACBFs, in contrast to CBFs, enable a flexible parameterization, resulting in a SF that involves a convex quadratic optimization problem, which significantly alleviates the online computational burden. We propose a learning-based approach to synthesize SACBFs. The effect of learning errors on the effectiveness of SACBFs is addressed by constraint tightening and introducing a new concept called contractive-set CBFs. This ensures formal safety guarantees for the learned CBFs and control policies. Simulation results on an inverted pendulum with elastic walls validate the proposed CBFs in terms of constraint satisfaction and CPU time.

Files

State-Action_Control_Barrier_F... (pdf)

(pdf | 0.615 Mb)