Gianluca DeMartini | TU Delft Repository

Plan-Then-Execute

An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant

Conference paper (2025) - G. He (author) , Gianluca DeMartini (author) , U.K. Gadiraju (author)

Since the explosion in popularity of ChatGPT, large language models (LLMs) have continued to impact our everyday lives. Equipped with external tools that are designed for a specific purpose (e.g., for flight booking or an alarm clock), LLM agents exercise an increasing capability ...

Editorial

Special Issue on Human in the Loop Data Curation

Journal article (2024) - Gianluca Demartini (author) , Shazia Sadiq (author) , J. Yang (author)

This Special Issue of the Journal of Data and Information Quality (JDIQ) contains novel theoretical and methodological contributions on data curation involving humans in the loop. In this editorial, we summarize the scope of the issue and briefly describe its content.

Workshop on Human-in-the-loop Data Curation

Conference paper (2022) - Gianluca Demartini (author) , J. Yang (author) , Shazia Sadiq (author)

Although data quality is a long-standing and enduring problem, it has recently received a resurgence of attention due to the fast proliferation of data analytics, machine learning, and decision-support applications built upon the wide-scale availability and accessibility of (big) ...

Crowd Worker Strategies in Relevance Judgment Tasks

Conference paper (2020) - Lei Han (author) , Eddy Maddalena (author) , Alessandro Checco (author) , Cristina Sarasua (author) , U.K. Gadiraju (author) , Kevin Roitero (author) , Gianluca DeMartini (author)

Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can ...

CrowdCO-OP: Sharing Risks and Rewards in Crowdsourcing

Journal article (2020) - Shaoyang Fan (author) , Ujwal Gadiraju (author) , Alessandro Checco (author) , Gianluca DeMartini (author)

Paid micro-task crowdsourcing has gained in popularity partly due to the increasing need for large-scale manually labelled datasets which are often used to train and evaluate Artificial Intelligence systems. Modern paid crowdsourcing platforms use a piecework approach to rewards, ...

Scalpel-CD: Leveraging Crowdsourcing and Deep Probabilistic Modeling for Debugging Noisy Training Data

Book chapter (2019) - J Yang (author) , Alisa Smirnova (author) , Dingqi Yang (author) , Gianluca DeMartini (author) , Yuan Lu (author) , Philippe Cudré-Mauroux (author)

This paper presents Scalpel-CD, a first-of-its-kind system that leverages both human and machine intelligence to debug noisy labels from the training data of machine learning systems. Our system identifies potentially wrong labels using a deep probabilistic model, which is able t ...

Modeling Task Complexity in Crowdsourcing

Conference paper (2016) - J. Yang (author) , Judith Redi (author) , Gianluca DeMartini (author) , A Bozzon (author)

Complexity is crucial to characterize tasks performed by humans through computer systems. Yet, the theory and practice of crowdsourcing currently lacks a clear understanding of task complexity, hindering the design of effective and efficient execution interfaces or fair monetary ...