Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

None, None; None, None; None, None; None, None; None, None

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Conference Paper (2018)

Author(s)

A.M.A. Balayn (Student TU Delft, IBM Nederland)

P. Mavridis (TU Delft - Web Information Systems)

A. Bozzon (TU Delft - Web Information Systems)

Benjamin Timmermans (IBM Nederland)

Zoltán Szlávik (IBM Nederland)

Research Group

Web Information Systems

Copyright

Crowdsourcing Dataset bias Machine Learning fairness Annotation aggregation

To reference this document use:

https://resolver.tudelft.nl/uuid:43f84e8d-71f7-4379-84a6-b6cac86253e5

More Info

expand_more

Publication Year

2018

Language

English

Copyright

Research Group

Web Information Systems

Volume number

2276

Pages (from-to)

67-71

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.

Files

Paper7.pdf

(pdf | 0.392 Mb)

License info not available