Print Email Facebook Twitter Gender bias in word embeddings of different languages Title Gender bias in word embeddings of different languages Author Raijmakers, Thijs (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Viering, T.J. (mentor) Makrodimitris, S. (mentor) Naseri Jahfari, A. (mentor) Tax, D.M.J. (mentor) Loog, M. (mentor) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2020-06-21 Abstract Word embeddings are useful for various applications, such as sentiment classification (Tang et al., 2014), word translation (Xing, Wang, Liu, & Lin, 2015) and résumé parsing (Nasser, Sreejith, & Irshad, 2018). Previous research has determined that word embeddings contain gender bias, which can be problematic in certain applications such as résumé parsing. This research has addressed the question whether gender bias is present in word embeddings of different languages. Gender bias has been measured on word embedding of 26 different lan- guages with the help of the Word Embedding Association Test by Caliskan, Bryson, and Narayanan (2017). The results show that most of the tested languages seem to have bias towards male, while a few languages seem to have a bias towards female. This result is in line with previous literature. Subject Natural Language Processinggender biasWord embeddingWEATlanguage To reference this document use: http://resolver.tudelft.nl/uuid:ea118174-0a65-438a-a417-8461e246499a Part of collection Student theses Document type bachelor thesis Rights © 2020 Thijs Raijmakers Files PDF Gender_bias_between_word_ ... guages.pdf 552.6 KB Close viewer /islandora/object/uuid:ea118174-0a65-438a-a417-8461e246499a/datastream/OBJ/view