Guided Metamorphic Transformations for Testing the Robustness of Trained Code2Vec Models

Marang, Ruben

Guided Metamorphic Transformations for Testing the Robustness of Trained Code2Vec Models

Title

Guided Metamorphic Transformations for Testing the Robustness of Trained Code2Vec Models

Author

Marang, Ruben (TU Delft Electrical Engineering, Mathematics and Computer Science)

Contributor

Panichella, A. (mentor)
Applis, L.H. (mentor)
van Deursen, A. (graduation committee)
Erkin, Z. (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Computer Science

Date

2022-08-30

Abstract

Machine learning models are increasingly being used within software engineering for their predictions. Research shows that these models’ performance is increasing with new research. This thesis focuses on models for method name prediction, for which the goal is to have a model that can accurately predict method names. With this thesis, we could create a tool that can suggest method names to software developers, which would assist in improving the quality of the projects.
This research aims to get insight into the robustness vulnerabilities of a method name prediction model. We use a genetic search algorithm that looks for these robustness problems. The main question this thesis tries to answer is to what extent the performance metrics are affected by applying metamorphic transformations to the test set of a trained code2vec model. Besides this, this thesis also proposes an alternative metric called percentage MRR, which might better reflect the robustness of a model. The main idea behind this metric is that it penalizes the prediction certainty of a model instead of penalizing the prediction rank.
To answer this research question, a tool is created that runs a genetic algorithm applying these metamorphic transformations to a dataset that a trained model is then evaluating. With this tool, we conducted 22 genetic search experiments on primary metrics and combinations of metrics to see the trade-offs in the Pareto fronts. The guided search of applying metamorphic transformations on the test set results in an average performance decrease of around 19%. This thesis also compares this drop in performance to the performance decrease a random search algorithm would create. Notably, for every transformer added, the average decrease in performance becomes smaller, and there are transformations, e.g., the if-false-else transformation, that have a bigger effect than others. This thesis concludes that the trained model is not robust against metamorphic transformations and has a significant performance drop.

Subject

Genetic algorithm
Machine learning
Metamorphic testing
Metamoprhic transformations
Neural network
Genetic search
Robustness
Evaluation framework
software engineering

To reference this document use:

http://resolver.tudelft.nl/uuid:1d6149f2-353a-4e8a-8f21-7874346cac91

Part of collection

Student theses

Document type

master thesis

Rights

Files

PDF

Master_thesis_Ruben_Marang.pdf

12.98 MB

Close viewer