Detecting and Mitigating Bias in Machine Learning Image Data through Semantic Description of the Attention Mechanism

The use-case Gender Bias in Profession Prediction from Images

Master thesis (2019)

Authors

G. Dimitropoulos Electrical Engineering, Mathematics and Computer Science

Contributors

G.J.P.M. Houben Web Information Systems - (supervisor 2)

A. Bozzon (supervisor 1)

Przemysław Pawełczak Embedded Systems - (supervisor 1)

Zoltán Szlávik (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

Bias Mitigation Bias Detection Bias Semantic Interpretation

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:38ec92a5-e140-4c2a-8083-4cdfb3264e8a

Published Date

16-09-2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Machine Learning models are increasingly used to assist or replace humans in a variety of decision-making domains. However, a lot of concerns have been raised about the impact of these decisions on people’s lives. In this work we focus on two main problems. The first one is that there might be discrimination between different groups of people with respect to their protected attributes in the aforementioned Machine Learning decision making applications. The second one is that there is lack of methods that actually interpret and explain the predictions of these Machine Learning systems which are then used to help decision making. Particularly, we focus on a specific aspect of these decision making applications, namely in Machine Learning training data for the problem of classification. Most research tackles solely individual aspects like trying to detect and mitigate bias and does not pay any attention to explain their reasoning in a human interpretable way. Also, the main method that they use is to balance the distribution of the training data with respect to the protected attribute. However, as we show, this is not always the solution to the problem. On the contrary, we study concurrently three steps (detection, semantic interpretation and mitigation of bias) to overcome these shortcomings and limitations and show that there is the presence of specific visual clues that leads to that bias. Finally, we perform an extensive evaluation of our method in order to verify its efficiency and effectiveness on the particular use-case Gender Bias in Profession Prediction from Images.

Files

Thesis_Report_George_Dimitropo... (.pdf)

(.pdf | 8.34 Mb)