Conjugate Dynamic Programming

None, None

Conjugate Dynamic Programming

Master Thesis (2021)

Author(s)

C. Rodopoulos (TU Delft - Mechanical Engineering)

Contributor(s)

P. Mohajerin Esfahani – Mentor (TU Delft - Team Bart De Schutter)

Mohamad Amin Sharifi Kolarijani – Graduation committee member (TU Delft - Team Peyman Mohajerin Esfahani)

G.F. Max – Graduation committee member (TU Delft - Team Peyman Mohajerin Esfahani)

Azita Dabiri – Coach (TU Delft - Team Azita Dabiri)

Faculty

Mechanical Engineering

Copyright

Dynamic Programming Approximate Dynamic Programming Legendre Transform

To reference this document use:

https://resolver.tudelft.nl/uuid:04b74b05-6cad-4a40-9c96-5d9881e7e8ce

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

06-12-2021

Awarding Institution

Delft University of Technology

Programme

['Mechanical Engineering | Systems and Control']

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In decision making problems, the ability to compute the optimal solution can pose a serious challenge. Dynamic Programming (DP) aims to provide a framework to deal with a category of such problems, namely ones that involve sequential decision making. By dividing the original control problem into sub-problems and solving it backwards in time, from the end of the time horizon to the start, the method can compute a map of optimal solutions with respect to the initial condition. In order to divide the original problem into subproblems the DP method takes advantage of the principle of optimality, which states that a sub-solution of the optimal solution should be the optimal solution for the equivalent subproblem. In control systems, where the state and decision spaces are continuous, the original DP framework can be intractable due to the size of the discretization needed to simulate the continuous space. Therefore, efficient approximations are needed to solve such problems. One promising method is called Conjugate Dynamic Programming (CDP). The CDP algorithm is able to transform the original framework and solve problems in the conjugate domain providing a computational advantage over the standard method. In this work, we aim to improve and extend the setting under which the CDP algorithm operates, thus providing a more concrete advantage over standard method . In that regard, we will extract the optimal control actions from within the CDP algorithm, eliminating the need for solving an extra optimization problem for their computation. In addition, we will introduce a different interpolation technique that can outperform the current one, in certain scenarios, thus granting the user more choice when solving a decision making problem.

Files

MSc_Thesis_Report_Rodopoulos_C... (pdf)

(pdf | 1.06 Mb)

License info not available