Autonomous Sailing with Sim-to-Real Reinforcement Learning

Bink, Kiki

Autonomous Sailing with Sim-to-Real Reinforcement Learning

Title

Autonomous Sailing with Sim-to-Real Reinforcement Learning

Author

Bink, Kiki (TU Delft Mechanical, Maritime and Materials Engineering)

Contributor

Weymouth, Gabriel D. (mentor)
Negenborn, R.R. (graduation committee)
Gelling, J.L. (graduation committee)
Duz, Bulent (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Marine Technology | Ship Hydrodynamics

Date

2024-02-28

Abstract

Facing the critical challenge of reducing greenhouse gas (GHG) emissions in the maritime industry, this thesis explores the potential of smart control systems using Reinforcement Learning (RL) for autonomous sailing. Traditional controls for sailing fall short in navigating the complex, dynamic conditions of maritime environments. RL has shown to be effective for continuous control applications in these types of conditions, however, primarily in simulated environments. Therefore, this study aims to show the potential of RL for autonomous sailing control (ASC) by means of a small scale project. A fast-time simulation of an Optimist is used to train the sailing controls required to reach an upwind target. The controls are then transferred to a robotized Optimist in a real-world environment to test the transferability of the simulation trained controls. First, the reality gap, or modelling error, between the simulation and real-world environment is quantified to be able to assess the performance of the used techniques to bridge the existing gap. The sim-to-real techniques of Domain Randomization (DR) and the addition of observation noise (ON) are applied during the training process. To test the effectiveness of the trained RL controls, the best performing ones in the simulation are selected and tested in the real-world environment. The performance of the RL controlled Optimist is compared to state-of-the-art controls in robotic sailing. Their performances are measured and compared by means of success rate and a physics-based metric that calculates the efficiency of the sailboat to use the power of the wind to propel itself, called the energy ratio. The results show that the RL controls are highly successful in the sailing simulation, however, the transfer to the real-world remains a major challenge. DR does improve the sim-to-real transfer, resulting in an agent that is able to reach a 100% success rate throughout 12 runs in the real-world environment.

Subject

Reinforcement Learning
Sailing
Machine Learning
Sim-to-Real
Autonomous Control

To reference this document use:

http://resolver.tudelft.nl/uuid:1284613a-d9e7-4076-b120-8349acdca4ca

Bibliographical note

Supplementary data and information about this thesis can be found here: https://www.marin.nl/en/research/artificial-intelligence-applications

Part of collection

Student theses

Document type

master thesis

Rights

Files

PDF

MSc_Thesis_AI_SAIL_Bink_K ... _FINAL.pdf

11.92 MB

Close viewer