Fault Tolerant Control for Autonomous Surface Vehicles via Model Reference Reinforcement Learning

Conference Paper (2021)
Author(s)

Qingrui Zhang (Sun Yat-sen University)

Xinyu Zhang (Sun Yat-sen University)

Bo Zhu (Sun Yat-sen University)

V. Reppa (TU Delft - Transport Engineering and Logistics)

Research Group
Transport Engineering and Logistics
Copyright
© 2021 Qingrui Zhang, Xinyu Zhang, Bo Zhu, V. Reppa
DOI related publication
https://doi.org/10.1109/CDC45484.2021.9683461
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Qingrui Zhang, Xinyu Zhang, Bo Zhu, V. Reppa
Research Group
Transport Engineering and Logistics
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Pages (from-to)
1536-1541
ISBN (print)
978-1-6654-3659-5
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

A novel fault tolerant control algorithm is proposed in this paper based on model reference reinforcement learning for autonomous surface vehicles subject to sensor faults and model uncertainties. The proposed control scheme is a combination of a model-based control approach and a data-driven method, so it can leverage the advantages of both sides. The proposed design contains a baseline controller that ensures stable tracking performance at healthy conditions, a fault observer that estimates sensor faults, and a reinforcement learning module that learns to accommodate sensor faults using fault estimation and compensate for model uncertainties. The impact of sensor faults and model uncertainties can be effectively mitigated by this composite design. Stable tracking performance can also be ensured even at both the offline training and online implementation stages for the learning-based fault tolerant control. A numerical simulation with gyro sensor faults is presented to demonstrate the efficiency of the proposed algorithm.

Files

Fault_Tolerant_Control_for_Aut... (pdf)
(pdf | 3.01 Mb)
- Embargo expired in 01-08-2022
License info not available