Print Email Facebook Twitter Deep Reinforcement Learning for the Automatic Six-Degree-of-Freedom Docking Maneuver of Space Vehicles Title Deep Reinforcement Learning for the Automatic Six-Degree-of-Freedom Docking Maneuver of Space Vehicles Author Casals Sadlier, Juliette (TU Delft Aerospace Engineering) Contributor van Kampen, E. (mentor) Degree granting institution Delft University of Technology Programme Aerospace Engineering Date 2022-10-31 Abstract The implementation of a model-free, off-policy, actor-critic deep reinforcement learning algorithm consistent of two separate agents to a six-degree-of freedom spacecraft docking maneuver to develop a control policy is carried out in the research presented in this article. Reinforcement learning has the ability to learn without instruction, this aspect provides a potential framework for autonomous docking maneuvers in uncertain environments with low on-board computational cost. A Twin-Delayed Deep Deterministic Policy Gradient algorithm consistent of two agents is used to synthesise the docking control policy valid for the six degree-of-freedom continuous state-space. Testing of the resultant policy exhibits its behaviour and capability to achieve successful docking within the established position and attitude ranges. Subject Deep Reinforcement LearningReinforcement Learning (RL)SpacecraftControl systemdocking To reference this document use: http://resolver.tudelft.nl/uuid:87a148b9-163b-41d7-aea8-479299be113c Part of collection Student theses Document type master thesis Rights © 2022 Juliette Casals Sadlier Files PDF Juliette_DRL_thesis_repor ... _final.pdf 11.52 MB Close viewer /islandora/object/uuid:87a148b9-163b-41d7-aea8-479299be113c/datastream/OBJ/view