Baylime: Bayesian local interpretable model-agnostic explanations

Conference paper (2021)

Authors

Xingyu Zhao Heriot-Watt University

Wei Huang University of Liverpool

Xiaowei Huang University of Liverpool

Valentin Robu Algorithmics -

David Flynn Heriot-Watt University

Research Group

Algorithmics () (TU Delft)

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:94f2d335-b03d-4555-b71d-7f50f59ee0d4

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Software Technology

Research Group

Algorithmics

Abstract

Given the pressing need for assuring algorithmic transparency, Explainable AI (XAI) has emerged as one of the key areas of AI research. In this paper, we develop a novel Bayesian extension to the LIME framework, one of the most widely used approaches in XAI – which we call BayLIME. Compared to LIME, BayLIME exploits prior knowledge and Bayesian reasoning to improve both the consistency in repeated explanations of a single prediction and the robustness to kernel settings. BayLIME also exhibits better explanation fidelity than the state-of-the-art (LIME, SHAP and Grad- CAM) by its ability to integrate prior knowledge from, e.g., a variety of other XAI techniques, as well as verification and validation (V&V) methods. We demonstrate the desirable properties of BayLIME through both theoretical analysis and extensive experiments.

Files

Zhao21a.pdf

(.pdf | 1.71 Mb)