Balancing multidimensional morality and progression

Evaluating the tradeoff for artificial agents playing text-based games

Bachelor thesis (2023)

Authors

B. Şerbănescu Electrical Engineering, Mathematics and Computer Science

Contributors

P.K. Murukannaiah Interactive Intelligence - (supervisor 1)

E. Liscio Interactive Intelligence - (supervisor 1)

D. Mambelli Interactive Intelligence - (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

Natural Language Processing Reinforcement Learning Artificial intelligence Moral Foundations Theory Policy Shaping Moral Decision Making

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:666d91fd-2c06-443a-9fbe-d194b1945cea

Published Date

03-07-2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Morality is a fundamental concept that guides humans in the decision-making process. Given the rise of large language models in society, it is necessary to ensure that they adhere to human principles, among which morality is of substantial importance. While research has been done regarding artificial agents behaving morally, current state of the art implementations consider morality to be linear, thus failing to capture its complexity and nuances. To account for this, a multidimensional representation of morality is proposed, each dimension corresponding to a different moral foundation. Then, the performance of three types of artificial agents tasked with choosing actions while playing text-based games is compared and analysed. One type of agent is implemented to only choose the most moral action, without aiming to win the games, another one prioritizes moral actions over game progression, and another strives to win the games while also playing morally. The latter outperforms the others in terms of game progression, while also taking few immoral actions. However, the agent prioritizing morality over progression performs only slightly worse while taking no immoral actions, proving that artificial agents can perform well while also behaving morally.

Files

Research_paper.pdf

(.pdf | 0.235 Mb)