Constrained Online Convex Optimization with Memory and Predictions

Journal Article (2026)
Author(s)

Mohammed Abdullah (Telecom SudParis, Université Paris Saclay)

George Iosifidis (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Salah Eddine Elayoubi (Université Paris Saclay)

Tijani Chahed (Telecom SudParis)

Research Group
Networked Systems
DOI related publication
https://doi.org/10.1609/aaai.v40i24.39031 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Research Group
Networked Systems
Journal title
Proceedings of the AAAI Conference on Artificial Intelligence
Issue number
24
Volume number
40
Pages (from-to)
19524-19532
Event
40th AAAI Conference on Artificial Intelligence, AAAI 2026 (2026-01-20 - 2026-01-27), Singapore, Singapore
Downloads counter
25
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We study Constrained Online Convex Optimization with Memory (COCO-M), where both the loss and the constraints depend on a finite window of past decisions made by the learner. This setting extends the previously studied unconstrained online optimization with memory framework and captures practical problems such as the control of constrained dynamical systems and scheduling with reconfiguration budgets. For this problem, we propose the first algorithms that achieve sublinear regret and sublinear cumulative constraint violation under time-varying constraints, both with and without predictions of future loss and constraint functions. Without predictions, we introduce an adaptive penalty approach that guarantees sublinear regret and constraint violation. When short-horizon and potentially unreliable predictions are available, we reinterpret the problem as online learning with delayed feedback and design an optimistic algorithm whose performance improves as prediction accuracy improves, while remaining robust when predictions are inaccurate. Our results bridge the gap between classical constrained online convex optimization and memory-dependent settings, and provide a versatile learning toolbox with diverse applications.

Files

Taverne
warning

File under embargo until 14-09-2026