What Should You Know? A Human-In-the-Loop Approach to Unknown Unknowns Characterization in Image Recognition

None, None; None, None; None, None; None, None; None, None

What Should You Know? A Human-In-the-Loop Approach to Unknown Unknowns Characterization in Image Recognition

Conference Paper (2022)

Author(s)

S. Sharifi Noorian (TU Delft - Web Information Systems)

S. Qiu (TU Delft - Web Information Systems)

U.K. Gadiraju (TU Delft - Web Information Systems)

J Yang (TU Delft - Web Information Systems)

A. Bozzon (TU Delft - Sustainable Design Engineering, TU Delft - Human-Centred Artificial Intelligence)

Research Group

Web Information Systems

Copyright

DOI related publication

https://doi.org/10.1145/3485447.3512040

Unknown unknowns Humans in the loop Semantic analysis

To reference this document use:

https://resolver.tudelft.nl/uuid:2e9e89cf-f216-4a2e-94ea-19ec053ec342

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Research Group

Web Information Systems

Pages (from-to)

882-892

ISBN (electronic)

978-1-4503-9096-5

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Unknown unknowns represent a major challenge in reliable image recognition. Existing methods mainly focus on unknown unknowns identification, leveraging human intelligence to gather images that are potentially difficult for the machine. To drive a deeper understanding of unknown unknowns and more effective identification and treatment, this paper focuses on unknown unknowns characterization. We introduce a human-in-the-loop, semantic analysis framework for characterizing unknown unknowns at scale. We engage humans in two tasks that specify what a machine should know and describe what it really knows, respectively, both at the conceptual level, supported by information extraction and machine learning interpretability methods. Data partitioning and sampling techniques are employed to scale out human contributions in handling large data. Through extensive experimentation on scene recognition tasks, we show that our approach provides a rich, descriptive characterization of unknown unknowns and allows for more effective and cost-efficient detection than the state of the art.

Files

3485447.3512040.pdf

(pdf | 9.7 Mb)