Crowd Worker Strategies in Relevance Judgment Tasks

Han, Lei; Maddalena, Eddy; Checco, Alessandro; Sarasua, Cristina; Gadiraju, U.K.; Roitero, Kevin; DeMartini, Gianluca

Crowd Worker Strategies in Relevance Judgment Tasks

Conference paper (2020)

Authors

Lei Han University of Queensland

Eddy Maddalena University of Southhampton

Alessandro Checco University of Sheffield

Cristina Sarasua Universitat Zurich

U.K. Gadiraju L3S Research Center

Kevin Roitero University of Udine

Gianluca DeMartini University of Queensland

Affiliation

External organisation

Crowdsourcing User behavior IR evaluation Relevance judgment

To reference this document use:

http://resolver.tudelft.nl/uuid:db925341-73f6-426d-b03c-da4143fde230

More Info

expand_more

Published Date

2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Affiliation

External organisation

Abstract

Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses. In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strate- gies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more effi- cient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results high- light the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of effi- cient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments.

No files available

Metadata only record. There are no files for this conference paper.