Learning Off-By-One Mistakes: An Empirical Study on Different Deep Learning Models

Master Thesis (2020)
Author(s)

H. Sellik (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Mauricio Aniche – Mentor (TU Delft - Software Engineering)

Onno van van Paridon – Coach (Adyen B.V.)

C. Hauff – Graduation committee member (TU Delft - Web Information Systems)

Arie Van Deursen – Graduation committee member (TU Delft - Software Technology)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2020 Hendrig Sellik
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 Hendrig Sellik
Graduation Date
22-09-2020
Awarding Institution
Delft University of Technology
Programme
['Computer Science']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Mistakes in binary conditions are a source of error in many software systems. They happen when developers use < or > instead of <= or >=. These boundary mistakes are hard to find for developers and pose a manual labor-intensive work. While researches have been proposing solutions to identify errors in boundary conditions, the problem remains a challenge. In this thesis, we propose deep learning models to learn mistakes in boundary conditions and train our model on approximately 1.6M examples with faults in different boundary conditions. We achieve an accuracy of 85.06%, a precision of 85.23% and a recall of 84.82% on a controlled dataset. Additionally, we perform tests on 41 real-world boundary condition bugs found from GitHub and try to find bugs from the Java project of Adyen. However, the false-positive rate of the model remains an issue. We hope that this work paves the way for future developments in using deep learning models for defect prediction.

Files

License info not available