Explainable artificial intelligence in forensics: Realistic explanations for number of contributor predictions of DNA profiles

None, None; None, None; None, None; None, None; None, None

Explainable artificial intelligence in forensics: Realistic explanations for number of contributor predictions of DNA profiles

Journal Article (2022)

Author(s)

M.S. Veldhuis (Nederlands Forensisch Instituut (NFI), Student TU Delft)

Simone Ariëns (Nederlands Forensisch Instituut (NFI))

Rolf J.F. Ypma (Nederlands Forensisch Instituut (NFI))

T.E.P.M.F. Abeel (TU Delft - Pattern Recognition and Bioinformatics)

Corina C.G. Benschop (Nederlands Forensisch Instituut (NFI))

Research Group

Pattern Recognition and Bioinformatics

Copyright

DOI related publication

https://doi.org/10.1016/j.fsigen.2021.102632

Machine learning DNA mixtures Explainable artificial intelligence Number of contributors Counterfactual explanations

To reference this document use:

https://resolver.tudelft.nl/uuid:b958ebed-e894-4697-8678-710a3c92d5e2

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Research Group

Pattern Recognition and Bioinformatics

Volume number

56

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Machine learning obtains good accuracy in determining the number of contributors (NOC) in short tandem repeat (STR) mixture DNA profiles. However, the models used so far are not understandable to users as they only output a prediction without any reasoning for that conclusion. Therefore, we leverage techniques from the field of explainable artificial intelligence (XAI) to help users understand why specific predictions are made. Where previous attempts at explainability for NOC estimation have relied upon using simpler, more understandable models that achieve lower accuracy, we use techniques that can be applied to any machine learning model. Our explanations incorporate SHAP values and counterfactual examples for each prediction into a single visualization. Existing methods for generating counterfactuals focus on uncorrelated features. This makes them inappropriate for the highly correlated features derived from STR data for NOC estimation, as these techniques simulate combinations of features that could not have resulted from an STR profile. For this reason, we have constructed a new counterfactual method, Realistic Counterfactuals (ReCo), which generates realistic counterfactual explanations for correlated data. We show that ReCo outperforms state-of-the-art methods on traditional metrics, as well as on a novel realism score. A user evaluation of the visualization shows positive opinions of end-users, which is ultimately the most appropriate metric in assessing explanations for real-world settings.

Files

Veldhuis2021.pdf

(pdf | 3.81 Mb)

- Embargo expired in 31-05-2022

License info not available