Key Insights from a Feature Discovery User Study

None, None; None, None; None, None; None, None

Key Insights from a Feature Discovery User Study

Conference Paper (2024)

Author(s)

Andra Ionescu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Zeger Mouw (Student TU Delft)

Efthimia Aivaloglou (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Asterios Katsifodimos (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Web Information Systems

DOI related publication

https://doi.org/10.1145/3665939.3665961 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:d2f447ba-0221-482d-8c2f-0ffbdd7a8895

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Web Information Systems

ISBN (electronic)

9798400706936

Event

2024 Workshop on Human-In-the-Loop Data Analytics, HILDA 2024, Co-located with SIGMOD 2024, Santiago, Chile

Downloads counter

286

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Multiple works in data management research focus on automating the processes of data augmentation and feature discovery to save users from having to perform these tasks manually. Yet, this automation often leads to a disconnect with the users, as it fails to consider the specific needs and preferences of the actual end-users of data management systems for machine learning. To explore this issue further, we conducted 19 semi-structured, think-aloud use-case studies based on a scenario in which data specialists were tasked with augmenting a base table with additional features to train a machine learning model. In this paper, we share key insights into the practices of feature discovery on tabular data performed by real-world data specialists derived from our user study. Our research uncovered differences between the user assumptions reported in the literature and the actual practices, as well as some areas where literature and real-world practices align.

Files

3665939.3665961.pdf

(pdf | 0.701 Mb)