MP
M. Peschl
2 records found
1
Authored
Aligning AI with Human Norms
Multi-Objective Deep Reinforcement Learning with Active Preference Elicitation
The field of deep reinforcement learning has seen major successes recently, achieving superhuman performance in discrete games such as Go and the Atari domain, as well as astounding results in continuous robot locomotion tasks. However, the correct specification of human intentio
...
We propose a deep reinforcement learning algorithm that employs an adversarial training strategy for adhering to implicit human norms alongside optimizing for a narrow goal objective. Previous methods which incorporate human values into reinforcement learning algorithms either sc
...