Online Convex Optimization with Predictions

None, None

Online Convex Optimization with Predictions

Static and Dynamic Environments

Master Thesis (2020)

Author(s)

P. Zattoni Scroccaro (TU Delft - Mechanical Engineering)

Contributor(s)

Peyman Mohajerinesfahani – Mentor (TU Delft - Team Tamas Keviczky)

S. Grammatico – Graduation committee member (TU Delft - Team Bart De Schutter)

B. Atasoy – Graduation committee member (TU Delft - Transport Engineering and Logistics)

Arman Sharifi K. – Graduation committee member (TU Delft - Team Tamas Keviczky)

Faculty

Mechanical Engineering

Copyright

Prediction Online Convex Optimization Online Learning

To reference this document use:

https://resolver.tudelft.nl/uuid:ce13b0da-fb0a-4e9f-b5a4-ef9b0dadf29b

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Graduation Date

21-09-2020

Awarding Institution

Delft University of Technology

Programme

['Mechanical Engineering | Systems and Control']

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this thesis, we study Online Convex Optimization algorithms that exploit predictive and/or dynamical information about a problem instance. These features are inspired by recent developments in the Online Mirror Decent literature. When the Player's performance is compared with the best fixed decision in hindsight, we show that it is possible to achieve constant regret bounds under perfect gradient predictions and optimal minimax bounds in the worst-case, generalizing previous results from the literature. For dynamic environments, we propose a new algorithm, and show that it achieves dynamic regret bounds that exploit both gradient predictions and knowledge about the dynamics of the action sequence that the Player's performance is being compared with. We present results for both convex and strongly convex costs. Finally, we provide numerical experiments that corroborate our theoretical results.

Files

Master_thesis_Pedro_Zattoni_Sc... (pdf)

(pdf | 2.21 Mb)

License info not available