How do you feel?

None, None; None, None; None, None; None, None; None, None

How do you feel?

Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection

Conference Paper (2023)

Author(s)

Philippe Lammerts (Student TU Delft)

P. Lippmann (TU Delft - Web Information Systems)

Yen-Chia Hsu (Universiteit van Amsterdam)

Fabio Casati (ServiceNow)

Jie Yang (TU Delft - Web Information Systems)

Research Group

Web Information Systems

Copyright

DOI related publication

https://doi.org/10.1145/3600211.3604655

Crowdsourcing Human-in-the-loop Rejection Hate speech Machine confidence Value-sensitive machine learning

To reference this document use:

https://resolver.tudelft.nl/uuid:5fc082f5-0993-4136-a6f9-576294c6b275

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Research Group

Web Information Systems

Pages (from-to)

834-844

ISBN (print)

979-8-4007-0231-0

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans' reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a value-sensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users' value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants' perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

Files

3600211.3604655.pdf

(pdf | 0.93 Mb)