Imitation learning for an ASV path planner in complex marine environments

None, None

Imitation learning for an ASV path planner in complex marine environments

A feasibility study

Master Thesis (2025)

Author(s)

B. Rutteman (TU Delft - Mechanical Engineering)

Contributor(s)

Laura Ferranti – Mentor (TU Delft - Learning & Autonomous Control)

Jonathan Klein Schiphorst – Mentor

J. Alonso-Mora – Graduation committee member (TU Delft - Learning & Autonomous Control)

Faculty

Mechanical Engineering

Imitation Learning ASV Path planner

To reference this document use:

https://resolver.tudelft.nl/uuid:7ddfbf20-853e-4ac7-b888-e4e60bba1474

More Info

expand_more

Publication Year

2025

Language

English

Coordinates

52.00,4.38

Graduation Date

26-06-2025

Awarding Institution

Delft University of Technology

Programme

Mechanical Engineering | Vehicle Engineering | Cognitive Robotics

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This thesis proposes a study towards the application of imitation learning (IL) algorithms Action Chunking Transformer (ACT) and diffusion policy as an autonomous surface vessel (ASV) path planner in complex marine environments. Rationale for conduct- ing this research are the ubiquitous limitations regarding fine tuning cost- and or reward functions of conventional state of the art algorithms for ASV navigation in complex environments. In this study we trained both algorithm types on data sources collected from a basic 2D-grid sim- ulator and a more realistic GAZEBO simulator. Subse- quently we evaluated both algorithm’s performances in each respective simulator in terms of the success rate for a standard navigation task. We found relatively high suc- cess rates for the 2D-grid simulator (0.98 for ACT and 0.53 for diffusion policy). For the GAZEBO simulator, we found poor performance for ACT (0.0 success rate) and for diffusion policy we could not establish the perfor- mance due to hardware limitations. For future work, the capabilities of both models could be investigated further by trying to bridge the gap between the simple 2D simula- tor and the GAZEBO simulator. Mainly the effect of task complexity and the quality and quantity of data used for training the models on the performances of the models can be investigated in these future work studies.

Files

Thesis_report_Bas_Rutteman_.pd... (pdf)

(pdf | 3.37 Mb)

License info not available