Towards Understanding Machine Learning Testing in Practise

None, None; None, None; None, None

Towards Understanding Machine Learning Testing in Practise

Conference Paper (2023)

Author(s)

Arumoy Shome (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Luís Cruz (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Arie Van Deursen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Software Engineering

Natural Language Processing Data Mining Image Analysis AI Engineering Computational Notebooks Machine Learning Testing NLP for Code

DOI related publication

https://doi.org/10.1109/CAIN58948.2023.00028 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:6f391be2-a267-48c3-878b-3164bfeb7279

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Software Engineering

Pages (from-to)

117-118

ISBN (electronic)

9798350301137

Event

2nd IEEE/ACM International Conference on AI Engineering - Software Engineering for AI, CAIN 2023 (2023-05-15 - 2023-05-16), Melbourne, Australia

Downloads counter

331

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Visualisations drive all aspects of the Machine Learning (ML) Development Cycle but remain a vastly untapped resource by the research community. ML testing is a highly interactive and cognitive process which demands a human-in-the-loop approach. Besides writing tests for the code base, bulk of the evaluation requires application of domain expertise to generate and interpret visualisations. To gain a deeper insight into the process of testing ML systems, we propose to study visualisations of ML pipelines by mining Jupyter notebooks. We propose a two prong approach in conducting the analysis. First, gather general insights and trends using a qualitative study of a smaller sample of notebooks. And then use the knowledge gained from the qualitative study to design an empirical study using a larger sample of notebooks. Computational notebooks provide a rich source of information in three formats - text, code and images. We hope to utilise existing work in image analysis and Natural Language Processing for text and code, to analyse the information present in notebooks. We hope to gain a new perspective into program comprehension and debugging in the context of ML testing.

Files

Towards_Understanding_Machine_... (pdf)

(pdf | 0.827 Mb)

- Embargo expired in 01-01-2024

License info not available