This thesis presents a simulation of a Martian lander using Reinforcement Learning. The objective is to train an agent to land on Mars using the Proximal Policy Optimization (PPO) method. The spacecraft is controlled by control allocation, where the translations and rotations are
...
This thesis presents a simulation of a Martian lander using Reinforcement Learning. The objective is to train an agent to land on Mars using the Proximal Policy Optimization (PPO) method. The spacecraft is controlled by control allocation, where the translations and rotations are controlled independently. Also a PD controller is made to control the same lander.
The PD controller is found to be more accurate in comparison with the Reinforcement learning controller. A reinforcement learning thruster model is also made. This spacecraft is controlled by five thrusters and the motions coupled. This lander needs more control effort, and is less accurate in tracking a reference velocity. It is also less accurate with landing on the designated landing spot.