Model predictive ship collision avoidance based on Q-learning beetle swarm antenna search and neural networks

None, None; None, None; None, None; None, None

Model predictive ship collision avoidance based on Q-learning beetle swarm antenna search and neural networks

Journal Article (2019)

Author(s)

Shuo Xie (TU Delft - Transport Engineering and Logistics, Wuhan University of Technology)

Vittorio Garofano (TU Delft - Transport Engineering and Logistics)

Xiumin Chu (Wuhan University of Technology)

Rudy R. Negenborn (Wuhan University of Technology, TU Delft - Transport Engineering and Logistics)

Research Group

Transport Engineering and Logistics

DOI related publication

https://doi.org/10.1016/j.oceaneng.2019.106609

Neural networks Collision avoidance Predictive control Beetle swarm antennas search Multi-ship encounters

To reference this document use:

https://resolver.tudelft.nl/uuid:810fe97f-3263-49d3-963d-63ada718cfb3

More Info

expand_more

Publication Year

2019

Language

English

Research Group

Transport Engineering and Logistics

Volume number

193

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Real-time collision avoidance with full consideration of ship maneuverability, collision risks and International Regulations for Preventing Collisions at Sea (COLREGs) is difficult in multi-ship encounters. To deal with this problem, a novel method is proposed based on model predictive control (MPC), an improved Q-learning beetle swarm antenna search (I-Q-BSAS) algorithm and neural networks. The main idea of this method is to use a neural network to approximate an inverse model based on decisions made with MPC for collision avoidance. Firstly, the predictive collision avoidance strategy is established following the MPC concept incorporating an I-Q-BSAS algorithm to solve the optimization problem. Meanwhile, the relative collision motion states in typical encounters are collected for training an inverse neural network model, which is used as an approximated optimal policy of MPC. Moreover, to deal with uncertain dynamics, the obtained policy is reinforced by long-term retraining based on an aggregation of on-policy and off-policy data. Ship collision avoidance in multi-ship encounters can be achieved by weighting the outputs of the neural network model with respect to different target ships. Simulation experiments under several typical and multi-ship encounters are carried out using the KVLCC2 ship model to verify the effectiveness of the proposed method.

Files

Paper.pdf

(pdf | 5.22 Mb)

- Embargo expired in 25-10-2021