A survey of actor-critic reinforcement learning: standard and natural policy gradients

Journal Article (2012)
Author(s)

I. Grondman (TU Delft - OLD Intelligent Control & Robotics)

I.L. Busoniu (TU Delft - DISC)

G.A. Delgado Lopes (TU Delft - OLD Intelligent Control & Robotics)

R Babuska (TU Delft - OLD Intelligent Control & Robotics)

Research Group
OLD Intelligent Control & Robotics
DOI related publication
https://doi.org/10.1109/TSMCC.2012.2218595
More Info
expand_more
Publication Year
2012
Language
English
Research Group
OLD Intelligent Control & Robotics
Issue number
6
Volume number
42
Pages (from-to)
1291-1307

No files available

Metadata only record. There are no files for this record.