Search results | TU Delft Repositories

Searched for: %2520

(1 - 1 of 1)

document: Eligibility traces and forgetting factor in recursive least-squares-based temporal difference
Baldi, S. (author), Zhang, Z. (author), Liu, Di (author)
We propose a new reinforcement learning method in the framework of Recursive Least Squares-Temporal Difference (RLS-TD). Instead of using the standard mechanism of eligibility traces (resulting in RLS-TD((Formula presented.))), we propose to use the forgetting factor commonly used in gradient-based or least-square estimation, and we show that...
journal article 2022