Searched for: +
(1 - 2 of 2)
document
Baldi, S. (author), Zhang, Z. (author), Liu, Di (author)
We propose a new reinforcement learning method in the framework of Recursive Least Squares-Temporal Difference (RLS-TD). Instead of using the standard mechanism of eligibility traces (resulting in RLS-TD((Formula presented.))), we propose to use the forgetting factor commonly used in gradient-based or least-square estimation, and we show that...
journal article 2022
document
Chen, Yong (author), Lv, Maolong (author), Baldi, S. (author), Liu, Zongcheng (author), Zhang, Wenqian (author), Zhou, Yang (author)
This work focuses on adaptive neural dynamic surface control (DSC) for an extended class of nonlinear MIMO strict-feedback systems whose control gain functions are continuous and possibly unbounded. The method is based on introducing a compact set which is eventually proved to be an invariant set: thanks to this set, the restrictive...
conference paper 2019