2 records found
1
Interaction of Instrumental and Goal-directed Learning Modulates Prediction Error Representations in the Ventral Striatum
Non-deterministic policy improvement stabilizes approximated reinforcement learning