Towards Understanding Machine Learning Testing in Practise

Conference Paper (2023)
Author(s)

Arumoy Shome (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Luís Cruz (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Arie Van Deursen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Software Engineering
DOI related publication
https://doi.org/10.1109/CAIN58948.2023.00028 Final published version
More Info
expand_more
Publication Year
2023
Language
English
Research Group
Software Engineering
Pages (from-to)
117-118
ISBN (electronic)
9798350301137
Event
2nd IEEE/ACM International Conference on AI Engineering - Software Engineering for AI, CAIN 2023 (2023-05-15 - 2023-05-16), Melbourne, Australia
Downloads counter
305
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Visualisations drive all aspects of the Machine Learning (ML) Development Cycle but remain a vastly untapped resource by the research community. ML testing is a highly interactive and cognitive process which demands a human-in-the-loop approach. Besides writing tests for the code base, bulk of the evaluation requires application of domain expertise to generate and interpret visualisations. To gain a deeper insight into the process of testing ML systems, we propose to study visualisations of ML pipelines by mining Jupyter notebooks. We propose a two prong approach in conducting the analysis. First, gather general insights and trends using a qualitative study of a smaller sample of notebooks. And then use the knowledge gained from the qualitative study to design an empirical study using a larger sample of notebooks. Computational notebooks provide a rich source of information in three formats - text, code and images. We hope to utilise existing work in image analysis and Natural Language Processing for text and code, to analyse the information present in notebooks. We hope to gain a new perspective into program comprehension and debugging in the context of ML testing.

Files

Towards_Understanding_Machine_... (pdf)
(pdf | 0.827 Mb)
- Embargo expired in 01-01-2024
License info not available