A counterfactual-based evaluation framework for machine learning models that use gene expression data

None, None

A counterfactual-based evaluation framework for machine learning models that use gene expression data

Master Thesis (2024)

Author(s)

M.E. Radder (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C.C.S. Liem – Mentor (TU Delft - Multimedia Computing)

P. Altmeyer – Mentor (TU Delft - Multimedia Computing)

T.E.P.M.F. Abeel – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

XAI Counterfactual explanations Faithfulness

To reference this document use:

https://resolver.tudelft.nl/uuid:4cf92f8f-2a4c-43e8-9746-2ff33ca65de5

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

15-10-2024

Awarding Institution

Delft University of Technology

Programme

['Computer Science']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The evaluation metrics commonly used for machine learning models often fail to adequately reveal the inner workings of the models, which is particularly necessarily in critical fields like healthcare. Explainable AI techniques, such as counterfactual explanations, offer a way to uncover a model’s internal process. However, these explanations are in literature often used for recourse actions rather than for testing a model’s internal mechanism. In this paper, we propose a proof of concept for a framework which uses counterfactual explanation to evaluate the inner workings of biological machine learning models that use gene expression data. Our approach involves comparing the change of gene expression observed in the original data to the change of gene expression observed between the factual and counterfactual data. The change of gene expression is quantified using the log fold change. Additionally, we expand the definition of faithfulness and introduce a new metric that measures how faithful the generated counterfactual explanations represent the model. This metric should ensure that the explanations accurately reflect the model’s true internal process.

Files

Thesis_Marit_Radder.pdf

(pdf | 4.14 Mb)

License info not available