Moral Embeddings: A closer look at their Performance, Generalizability and Transferability

Vecerdea, Dragos

Moral Embeddings: A closer look at their Performance, Generalizability and Transferability

Title

Moral Embeddings: A closer look at their Performance, Generalizability and Transferability

Author

Vecerdea, Dragos (TU Delft Electrical Engineering, Mathematics and Computer Science; TU Delft Intelligent Systems)

Contributor

Liscio, E. (mentor)
Murukannaiah, P.K. (mentor)
Marroquim, Ricardo (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Computer Science and Engineering

Project

CSE3000 Research Project

Date

2021-07-02

Abstract

Moral values are abstract ideas that ground our judgements towards what is right or wrong. However, with the rapid unfold of moral rhetoric on social media, it becomes increasingly important to place these ideas in a moral frame, contain their harmful effects, and recognise their positive ones. So far, estimating values from opinionated text has posed a challenge due to values' abstract and subjective nature. However, with the latest developments in Natural Language Processing (NLP), we foresee an opportunity to align the study of morality in text with state-of-the-art NLP architectures. Recently published, the Moral Foundations Tweeter Corpus is a milestone in moral classification tasks by offering a dataset that allows for a closer look into how people express moral narratives in social media. In the downstream process of a text classifier, embeddings convert words and sentences into meaningful vectors. Pre-trained on large corpora, they can be fine-tuned, and domain adapted. This study proposes a refinement model, starting from the available dataset, that learns to capture moral information in Sentence-BERT embeddings by applying a state-of-the-art supervised method (triplet loss). We further demonstrate how the refined embeddings improve the accuracy of moral classifiers. Finally, with an improvement of 5% F1-score over models that use pre-trained embeddings, we pave the way towards a generalisable and transferable set of moral embeddings.

Subject

Natural Language Processing
Text Classification
Moral Foundations Theory
Sentence-BERT
Triplet Loss

To reference this document use:

http://resolver.tudelft.nl/uuid:d4d2cd56-231d-4ef9-b2e9-9585a8313f83

Embargo date

2022-12-31

Part of collection

Student theses

Document type

bachelor thesis

Rights

Files

PDF

Research_Project_Final.pdf

383.17 KB

Close viewer