Crash reproduction difficulty, an initial assessment

None, None; None, None; None, None; None, None

Crash reproduction difficulty, an initial assessment

Journal Article (2020)

Author(s)

Boris Cherry (University of Namur)

Xavier Devroey (TU Delft - Software Engineering)

P. Derakhshanfar (TU Delft - Software Engineering)

Benoît Vanderose (University of Namur)

Research Group

Software Engineering

Copyright

Code quality Change metrics Search-based crash reproduction Software measurement

To reference this document use:

https://resolver.tudelft.nl/uuid:c41420bb-8db8-4ac0-98ed-2b054477f598

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Research Group

Software Engineering

Volume number

2912

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This study presents the initial step towards a thorough analysis of the difficulty to reproduce a crash using searchbased crash reproduction. Traditionally, code size and complexity are considered representative indicators of the difficulty for search-based approaches, like search-based unit test generation, to generate tests. However, unlike unit test generation, crash reproduction does not seek to cover a set of behaviors but instead to generate one or more tests exercising a specific behavior reproducing a given crash. In this context, there is no guarantee that the indicators used for unit testing are still valid for crash reproduction. In this study, we seek to identify such indicators by considering various code metrics, code smells, and change metrics. We report our effort to collect those metrics for JCRASHPACK, a state-of-the-art crash reproduction benchmark, and an initial assessment by considering metrics individually. Our results show that although JCRASHPACK is larger than benchmarks used in previous studies, additional crashes should be added to improve its diversity and representativeness, and that no individual metric can be used to characterize the difficulty to reproduce a crash.

Files

Cherry_etal_final.pdf

(pdf | 0.307 Mb)