Lazy Lagrangians for Optimistic Learning With Budget Constraints

None, None; None, None; None, None

Lazy Lagrangians for Optimistic Learning With Budget Constraints

Journal Article (2023)

Author(s)

Daron Anderson (Trinity College Dublin)

George Iosifidis (TU Delft - Embedded Systems)

Douglas Leith (Trinity College Dublin)

Research Group

Embedded Systems

Copyright

DOI related publication

https://doi.org/10.1109/TNET.2022.3222404

Online learning Resource allocation Network management Network control Online convex optimization (OCO)

To reference this document use:

https://resolver.tudelft.nl/uuid:e343b1be-344f-4de5-b3c0-a8f6140f02c7

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Research Group

Embedded Systems

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. @en

Issue number

5

Volume number

31

Pages (from-to)

1935 - 1949

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We consider the general problem of online convex optimization with time-varying budget constraints in the presence of predictions for the next cost and constraint functions, that arises in a plethora of network resource management problems. A novel saddle-point algorithm is designed by combining a Follow-The-Regularized-Leader iteration with prediction-adaptive dynamic steps. The algorithm achieves O(T(3β/4) regret and O(T(1+β)/2) constraint violation bounds that are tunable via parameter β ∈ [1/2,1) and have constant factors that shrink with the predictions quality, achieving eventually O(1) regret for perfect predictions. Our work extends the seminal FTRL framework for this new OCO setting and outperforms the respective state-of-the-art greedy-based solutions which naturally cannot benefit from predictions, without imposing conditions on the (unknown) quality of predictions, the cost functions or the geometry of constraints, beyond convexity.

Files

Lazy_Lagrangians_for_Optimisti... (pdf)

(pdf | 1.2 Mb)

- Embargo expired in 01-11-2023

License info not available