PG
P. Garcia de Vinuesa Garcia
info
Please Note
<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>
2 records found
1
Uncertainty-Aware Hybrid Reinforcement Learning for Fault-Tolerant Flight Control
Hybrid Reinforcement Learning for the Flight Control System of a Cessna 550 Citation II
Master thesis
(2026)
-
P. Garcia de Vinuesa Garcia, E. van Kampen, Spilios Theodoulis, Xuerui Wang, I.Z. El-Hajj
Fault-tolerant flight control remains a major challenge as aircraft systems become increasingly autonomous and must operate under uncertain conditions and potential actuator failures. Reinforcement learning has shown strong potential for learning control policies directly from interaction with the environment, but purely offline-trained agents often lack the ability to adapt once deployed.
This thesis proposes a hybrid reinforcement learning framework that combines offline deep reinforcement learning with online adaptive control. A novel RUN-DSAC-IDHP controller is developed, integrating an uncertainty-aware offline policy learned with RUN-DSAC with an online Incremental Dual Heuristic Programming (IDHP) adaptation layer. The offline component provides a high-performance baseline policy, while the IDHP actor continuously adapts the control policy online when the aircraft dynamics change.
The proposed approach demonstrates how combining offline deep reinforcement learning with online adaptive control enables controllers to maintain strong baseline performance while adapting in real time to faults and changing flight conditions. ...
This thesis proposes a hybrid reinforcement learning framework that combines offline deep reinforcement learning with online adaptive control. A novel RUN-DSAC-IDHP controller is developed, integrating an uncertainty-aware offline policy learned with RUN-DSAC with an online Incremental Dual Heuristic Programming (IDHP) adaptation layer. The offline component provides a high-performance baseline policy, while the IDHP actor continuously adapts the control policy online when the aircraft dynamics change.
The proposed approach demonstrates how combining offline deep reinforcement learning with online adaptive control enables controllers to maintain strong baseline performance while adapting in real time to faults and changing flight conditions. ...
Fault-tolerant flight control remains a major challenge as aircraft systems become increasingly autonomous and must operate under uncertain conditions and potential actuator failures. Reinforcement learning has shown strong potential for learning control policies directly from interaction with the environment, but purely offline-trained agents often lack the ability to adapt once deployed.
This thesis proposes a hybrid reinforcement learning framework that combines offline deep reinforcement learning with online adaptive control. A novel RUN-DSAC-IDHP controller is developed, integrating an uncertainty-aware offline policy learned with RUN-DSAC with an online Incremental Dual Heuristic Programming (IDHP) adaptation layer. The offline component provides a high-performance baseline policy, while the IDHP actor continuously adapts the control policy online when the aircraft dynamics change.
The proposed approach demonstrates how combining offline deep reinforcement learning with online adaptive control enables controllers to maintain strong baseline performance while adapting in real time to faults and changing flight conditions.
This thesis proposes a hybrid reinforcement learning framework that combines offline deep reinforcement learning with online adaptive control. A novel RUN-DSAC-IDHP controller is developed, integrating an uncertainty-aware offline policy learned with RUN-DSAC with an online Incremental Dual Heuristic Programming (IDHP) adaptation layer. The offline component provides a high-performance baseline policy, while the IDHP actor continuously adapts the control policy online when the aircraft dynamics change.
The proposed approach demonstrates how combining offline deep reinforcement learning with online adaptive control enables controllers to maintain strong baseline performance while adapting in real time to faults and changing flight conditions.
Bachelor thesis
(2023)
-
D. Çelebi, M.N.L. de Bruijn, J. de Wit, P. Garcia de Vinuesa Garcia, O.P. Heukelom, J. Louro Fuentes, M.A. Rombouts, M.H. van Opstal, J.I.J. Verhoeff, M.J. Wolf, B.D.W. Remes, Bianca Giovanardi, PhD, S.H.J. Westerbeek