State-Action Control Barrier Functions

Imposing Safety on Learning-Based Control With Low Online Computational Costs

Journal Article (2026)
Author(s)

Kanghui He (TU Delft - Team Bart De Schutter)

Shengling Shi (Massachusetts Institute of Technology)

Ton Van Den Boom (TU Delft - Team Ton van den Boom)

Bart De Schutter (TU Delft - Delft Center for Systems and Control)

DOI related publication
https://doi.org/10.1109/TAC.2025.3636804 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Journal title
IEEE Transactions on Automatic Control
Issue number
5
Volume number
71
Pages (from-to)
3365-3371
Downloads counter
6
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Learning-based control with safety guarantees usually requires real-time safety certification and modifications of possibly unsafe learning-based policies. The control barrier function (CBF) method uses a safety filter (SF) containing a constrained optimization problem to produce safe policies. However, finding a valid CBF for a general nonlinear system requires a complex function parameterization, which in general makes the policy optimization problem difficult to solve in real time. For nonlinear systems with nonlinear state constraints, this paper proposes the novel concept of state-action CBFs (SACBFs), which do not only characterize the safety at each state but also evaluate the control inputs taken at each state. SACBFs, in contrast to CBFs, enable a flexible parameterization, resulting in a SF that involves a convex quadratic optimization problem, which significantly alleviates the online computational burden. We propose a learning-based approach to synthesize SACBFs. The effect of learning errors on the effectiveness of SACBFs is addressed by constraint tightening and introducing a new concept called contractive-set CBFs. This ensures formal safety guarantees for the learned CBFs and control policies. Simulation results on an inverted pendulum with elastic walls validate the proposed CBFs in terms of constraint satisfaction and CPU time.