A Human-In-the-Loop Framework to Assess Multimodal Machine Learning Models

Master thesis (2022)

Authors

D. Chen Electrical Engineering, Mathematics and Computer Science

Contributors

J. Yang Web Information Systems - (supervisor 1)

G.J.P.M. Houben Web Information Systems - (supervisor 2)

G. Lan Embedded Systems - (supervisor 2)

A. Tocchetti (supervisor 1)

Faculty

Electrical Engineering, Mathematics and Computer Science

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:806e001d-9bf3-49e1-b2f4-298c747aea2a

Published Date

30-11-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Recent works explain the DNN models that perform image classification tasks following the "attribution, human-in-the-loop, extraction" workflow. However, little work has looked into such an approach for explaining DNN models for language or multimodal tasks. To address this gap, we propose a framework that explains and assesses the model utilizing both the categorical/numerical features and the text while optimizing the "attribution, human-in-the-loop, extraction" workflow. In particular, our framework deals with limited human resources, especially when domain experts are required for human-in-the-loop tasks. It provides insight regarding which set of data should the human-in-the-loop tasks be brought in. We share the results of applying this framework to a multimodal transformer that performs text classification tasks for compliance detection in the financial context.

Files

Title_11_.pdf

(.pdf | 10.6 Mb)