Natural Language Processing and Reinforcement Learning to Generate Morally
What is the optimal weight w to win the games while playing morally?
K.T.C. Boudier (TU Delft - Electrical Engineering, Mathematics and Computer Science)
E. Liscio – Mentor (TU Delft - Interactive Intelligence)
D. Mambelli – Mentor (TU Delft - Interactive Intelligence)
P.K. Murukannaiah – Mentor (TU Delft - Interactive Intelligence)
J. Yang – Graduation committee member (TU Delft - Web Information Systems)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
In our everyday life, people interact more and more with agents. However these agents often lack a moral sense and prioritize the accomplishment of the given task. In consequence, agents may unknowingly act immorally. Little research or progress has been done to endow agents with human morality and an internal sense of right and wrong. As of today, agents have a primitive representation of morality often represented as 1 value. In contrast, humans have multiple reasons to judge an action as moral. In hope of creating agents that are imbued with a more complex and human moral, we build upon the Jiminy Cricket environment. This preexisting environment has multiple games with diverse scenarios and the objective is to do the most moral action to maximize the reward