Annotation Practices in Societally Impactful Machine Learning Applications

None, None

Annotation Practices in Societally Impactful Machine Learning Applications

What are the recommender systems models actually trained on?

Bachelor Thesis (2023)

Author(s)

A.G. Sav (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Cynthia CS Liem – Mentor (TU Delft - Multimedia Computing)

Andrew Demetriou – Mentor (TU Delft - Multimedia Computing)

F. Broz – Graduation committee member (TU Delft - Interactive Intelligence)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Machine Learning Annotation Practices Data collection Societal Impact Recommender systems

To reference this document use:

https://resolver.tudelft.nl/uuid:023750e2-b304-4a1e-9da2-826ad9e34a91

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

28-06-2023

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Machine Learning models are nowadays infused into all aspects of our lives. Perhaps one of its most common applications regards recommender systems, as they facilitate users' decision-making processes in various scenarios (e.g., e-commerce, social media, news, online learning, etc.). Training performed on large volumes of data is what ultimately drives such a system to provide meaningful recommendations, and yet there has been observed a lack of standardized practices when it comes to data collection and annotation methods for Machine Learning datasets. This research paper systematically identifies and synthesizes such processes by examining existing literature on recommender systems. The review includes 100 most-cited papers from the most impactful venues within the Computing and Information Technology field. Multiple facets of the employed techniques are touched upon, such as reported human annotations and annotator diversity, label quality, and the public availability of training datasets.
Recurrent use of just a few benchmark datasets, poor documentation practices, and reproducibility issues in experiments are some of the most striking findings uncovered by this study. A discussion is centered around the necessity of transitioning from reliance solely on algorithmic performance metrics in favor of prioritizing data quality and fit. Finally, valid concerns are raised when it comes to biases and socio-psychological factors inherent in the datasets, and further exploration of embedding these early in the design of ML models is suggested.

Files

CSE3000_Final_Paper.pdf

(pdf | 0.179 Mb)

License info not available