Print Email Facebook Twitter Moral Embeddings: A closer look at their Performance, Generalizability and Transferability Title Moral Embeddings: A closer look at their Performance, Generalizability and Transferability Author Vecerdea, Dragos (TU Delft Electrical Engineering, Mathematics and Computer Science; TU Delft Intelligent Systems) Contributor Liscio, E. (mentor) Murukannaiah, P.K. (mentor) Marroquim, Ricardo (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2021-07-02 Abstract Moral values are abstract ideas that ground our judgements towards what is right or wrong. However, with the rapid unfold of moral rhetoric on social media, it becomes increasingly important to place these ideas in a moral frame, contain their harmful effects, and recognise their positive ones. So far, estimating values from opinionated text has posed a challenge due to values' abstract and subjective nature. However, with the latest developments in Natural Language Processing (NLP), we foresee an opportunity to align the study of morality in text with state-of-the-art NLP architectures. Recently published, the Moral Foundations Tweeter Corpus is a milestone in moral classification tasks by offering a dataset that allows for a closer look into how people express moral narratives in social media. In the downstream process of a text classifier, embeddings convert words and sentences into meaningful vectors. Pre-trained on large corpora, they can be fine-tuned, and domain adapted. This study proposes a refinement model, starting from the available dataset, that learns to capture moral information in Sentence-BERT embeddings by applying a state-of-the-art supervised method (triplet loss). We further demonstrate how the refined embeddings improve the accuracy of moral classifiers. Finally, with an improvement of 5% F1-score over models that use pre-trained embeddings, we pave the way towards a generalisable and transferable set of moral embeddings. Subject Natural Language ProcessingText ClassificationMoral Foundations TheorySentence-BERTTriplet Loss To reference this document use: http://resolver.tudelft.nl/uuid:d4d2cd56-231d-4ef9-b2e9-9585a8313f83 Embargo date 2022-12-31 Part of collection Student theses Document type bachelor thesis Rights © 2021 Dragos Vecerdea Files PDF Research_Project_Final.pdf 383.17 KB Close viewer /islandora/object/uuid:d4d2cd56-231d-4ef9-b2e9-9585a8313f83/datastream/OBJ/view