Constrained Online Convex Optimization with Memory and Predictions

None, None; None, None; None, None; None, None

Constrained Online Convex Optimization with Memory and Predictions

Journal Article (2026)

Author(s)

Mohammed Abdullah (Telecom SudParis, Université Paris Saclay)

George Iosifidis (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Salah Eddine Elayoubi (Université Paris Saclay)

Tijani Chahed (Telecom SudParis)

Research Group

Networked Systems

DOI related publication

https://doi.org/10.1609/aaai.v40i24.39031 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:309f69df-5618-45bf-8082-8e807c9fa104

More Info

expand_more

Publication Year

2026

Language

English

Research Group

Networked Systems

Journal title

Proceedings of the AAAI Conference on Artificial Intelligence

Issue number

24

Volume number

40

Pages (from-to)

19524-19532

Event

40th AAAI Conference on Artificial Intelligence, AAAI 2026 (2026-01-20 - 2026-01-27), Singapore, Singapore

Downloads counter

25

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We study Constrained Online Convex Optimization with Memory (COCO-M), where both the loss and the constraints depend on a finite window of past decisions made by the learner. This setting extends the previously studied unconstrained online optimization with memory framework and captures practical problems such as the control of constrained dynamical systems and scheduling with reconfiguration budgets. For this problem, we propose the first algorithms that achieve sublinear regret and sublinear cumulative constraint violation under time-varying constraints, both with and without predictions of future loss and constraint functions. Without predictions, we introduce an adaptive penalty approach that guarantees sublinear regret and constraint violation. When short-horizon and potentially unreliable predictions are available, we reinterpret the problem as online learning with delayed feedback and design an optimistic algorithm whose performance improves as prediction accuracy improves, while remaining robust when predictions are inaccurate. Our results bridge the gap between classical constrained online convex optimization and memory-dependent settings, and provide a versatile learning toolbox with diverse applications.

Files

29763-AAAI26.AbdullahM-ML.pdf

(pdf | 0.623 Mb)

Taverne

File under embargo until 14-09-2026