A Two-Dimensional Explanation Framework to Classify AI as Incomprehensible, Interpretable, or Understandable

Conference Paper (2021)
Author(s)

Ruben S. Verhagen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Mark A. Neerincx (TU Delft - Electrical Engineering, Mathematics and Computer Science, TNO)

Myrthe L. Tielman (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Interactive Intelligence
DOI related publication
https://doi.org/10.1007/978-3-030-82017-6_8 Final published version
More Info
expand_more
Publication Year
2021
Language
English
Research Group
Interactive Intelligence
Bibliographical Note
Accepted author manuscript
Pages (from-to)
119-138
Publisher
Springer
ISBN (print)
978-3-030-82016-9
ISBN (electronic)
978-3-030-82017-6
Event
EXTRAAMAS 2021 (2021-05-03 - 2021-05-07), Virtual at London, United Kingdom
Downloads counter
326
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Because of recent and rapid developments in Artificial Intelligence (AI), humans and AI-systems increasingly work together in human-agent teams. However, in order to effectively leverage the capabilities of both, AI-systems need to be understandable to their human teammates. The branch of eXplainable AI (XAI) aspires to make AI-systems more understandable to humans, potentially improving human-agent teamwork. Unfortunately, XAI literature suffers from a lack of agreement regarding the definitions of and relations between the four key XAI-concepts: transparency, interpretability, explainability, and understandability. Inspired by both XAI and social sciences literature, we present a two-dimensional framework that defines and relates these concepts in a concise and coherent way, yielding a classification of three types of AI-systems: incomprehensible, interpretable, and understandable. We also discuss how the established relationships can be used to guide future research into XAI, and how the framework could be used during the development of AI-systems as part of human-AI teams.

Files

EXTRAAMAS2021_Verhagen_postpri... (pdf)
(pdf | 0.869 Mb)
- Embargo expired in 17-07-2022
License info not available