A survey of actor-critic reinforcement learning: standard and natural policy gradients
Journal Article
(2012)
Author(s)
I. Grondman (TU Delft - OLD Intelligent Control & Robotics)
I.L. Busoniu (TU Delft - DISC)
G.A. Delgado Lopes (TU Delft - OLD Intelligent Control & Robotics)
R Babuska (TU Delft - OLD Intelligent Control & Robotics)
Research Group
OLD Intelligent Control & Robotics
DOI related publication
https://doi.org/10.1109/TSMCC.2012.2218595
To reference this document use:
https://resolver.tudelft.nl/uuid:1215bcdc-0cdc-44a6-99a2-b92f1295f744
More Info
expand_more
expand_more
Publication Year
2012
Language
English
Research Group
OLD Intelligent Control & Robotics
Issue number
6
Volume number
42
Pages (from-to)
1291-1307
No files available
Metadata only record. There are no files for this record.