Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web

A Survey of Technical Biases Informed by Psychology Literature

Journal Article (2021)
Author(s)

A.M.A. Balayn (TU Delft - Web Information Systems)

J. Yang (TU Delft - Web Information Systems)

Zoltán Szlávik (myTomorrows)

A. Bozzon (TU Delft - Human-Centred Artificial Intelligence)

Research Group
Web Information Systems
Copyright
© 2021 A.M.A. Balayn, J. Yang, Zoltán Szlávik, A. Bozzon
DOI related publication
https://doi.org/10.1145/3479158
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 A.M.A. Balayn, J. Yang, Zoltán Szlávik, A. Bozzon
Research Group
Web Information Systems
Issue number
3
Volume number
4
Pages (from-to)
11:1 - 11:56
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community.

In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.