Finding Shortcuts to a black-box model using Frequent Sequence Mining

Explaining Deep Learning models for Fact-Checking

Bachelor thesis (2023)

Authors

J.P. Smit Electrical Engineering, Mathematics and Computer Science

Contributors

A. Anand Web Information Systems - (supervisor 1)

L. Lyu Web Information Systems - (supervisor 1)

L. Corti Web Information Systems - (supervisor 1)

M. Loog Pattern Recognition and Bioinformatics - (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

Explainable Artificial Intelligence Fact-checking Explainability Association Rule Mining Frequent Sequence Mining

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:647cf28b-443b-4119-9020-7628c617b950

Published Date

03-02-2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Deep-learning (DL) models could greatly advance the automation of fact-checking, yet have not widely been adopted by the public because of their hard-to-explain nature. Although various techniques have been proposed to use local explanations for the behaviour of DL models, little attention has been paid to global explanations.
In response, we investigate whether a frequent sequence mining (FSM) tool finds sequence patterns, that act as shortcuts, to a state-of-the-art model in the context of fact-checking. By studying the connections between a model’s input and output, association rules (ARs) can be used as a global explanation for the interpretation of the model. The shortcuts were evaluated using a heuristic-based minimum support value, the strength of each rule was determined using confidence, and the support value indicates the global coverage of rules. Shortcuts help to form an interpretation for creating counterfactual prompts, which can be used as a risk assessment tool for DL models. Other applications for rule-based global explanations are left for future work

Files

BEPJeanpaulSmit.pdf

(.pdf | 0.805 Mb)