Print Email Facebook Twitter Longitudinal Control for Autonomous Vehicles Title Longitudinal Control for Autonomous Vehicles: A comparison between Reinforcement Learning and Optimal Control Author Faassen, Gilles (TU Delft Mechanical Engineering) Contributor Puccetti, Luca (mentor) Alirezaei, M. (mentor) Hellendoorn, J. (mentor) Ferrari, Riccardo M.G. (mentor) Degree granting institution Delft University of Technology Programme Mechanical Engineering | Systems and Control Date 2019-03-29 Abstract In the automotive industry automation is popular and every year car OEMs advance their technology to be able to drive autonomously. Longitudinal control of the vehicles is an important part of the complete autonomous driving system. The difficulty of this control problem lies with changing longitudinal dynamics and the lack of full-state system information. This complicates controller design when using classic model-based approaches such as Optimal Control (OC). Currently the controllers are still manually tuned by control engineers in thevehicle. This is time consuming and expensive, therefore other methods for controller design such as learning are explored. Reinforcement Learning (RL) is one of those methods. To examine the potential benefits of learning a controller, this work will make a comparison between RL and OC. For RL, an actor-critic structure using deterministic policy gradient is applied. Due to partially observable system dynamics OC is used as an optimal output feedback controller. The comparison complies speed control of an autonomous vehicle. The RL agent will learn a controller by training on a nonlinear high fidelity vehicle model. In this work it was demonstrated that RL can reach the same performance as OC when all environmental settings are comparable. When environmental settings deviate, it was is found that RL outperforms OC. To verify the simulated results all controllers were confirmed in an experimental real-life setting.In conclusion, this proved a promising benefit of learning with respect to classical controller computation, when dealing with partially available system information. Subject reinforcement learningoptimal controlautonomous driving To reference this document use: http://resolver.tudelft.nl/uuid:8ab1fc10-d30e-4f9d-8003-fbc40d675c13 Embargo date 2024-03-29 Part of collection Student theses Document type master thesis Rights © 2019 Gilles Faassen Files PDF mscThesis_GF.pdf 8.17 MB Close viewer /islandora/object/uuid:8ab1fc10-d30e-4f9d-8003-fbc40d675c13/datastream/OBJ/view