Do Joint Energy-Based Models Produce More Plausible Counterfactual Explanations?

None, None

Do Joint Energy-Based Models Produce More Plausible Counterfactual Explanations?

Bachelor Thesis (2024)

Author(s)

G. Pezzali (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

P. Altmeyer – Mentor (TU Delft - Multimedia Computing)

C.C.S. Liem – Mentor (TU Delft - Multimedia Computing)

B.J.W. Dudzik – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Explainable AI Counterfactual Explanations Joint Energy-Based Models

To reference this document use:

https://resolver.tudelft.nl/uuid:afe2d50d-f4b3-403f-b0e7-a0b8ede96bb0

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

27-06-2024

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Counterfactual explanations (CEs) can be used to gain useful insights into the behaviour of opaque classification models, allowing users to make an informed decision when trusting such systems. Assuming the CEs of a model are faithful (they well represent the inner workings of the model), an explainable model generates plausible CEs (i.e. CEs fitting the real-world distribution of the data). This raises the question of whether classifiers explicitly designed to model the distribution of the data, such as energy-based models, are inherently more explainable. This work focuses on the evaluation of joint energy-based models (JEMs) in combination with the Energy-Constrained Conformal Counterfactuals (ECCCo) generator, with the goal of identifying if the generative capability of a model influences its explainability. Since ECCCo has been designed specifically to generate more faithful CEs, it makes it possible to use the CEs plausibility as a proxy of the model explainability. Two experiments have been performed to evaluate the effect of variations of generative capability within the same JEM architecture and the difference between JEMs and classically trained classifiers. Despite the experiments not having established a clear correlation between generative capability and explainability of a model, various research avenues are still open to explore in future works

Files

TUD_CSE_BSc_Research_Paper_Gia... (pdf)

(pdf | 1.51 Mb)

License info not available