Print Email Facebook Twitter Guided Metamorphic Transformations for Testing the Robustness of Trained Code2Vec Models Title Guided Metamorphic Transformations for Testing the Robustness of Trained Code2Vec Models Author Marang, Ruben (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Panichella, A. (mentor) Applis, L.H. (mentor) van Deursen, A. (graduation committee) Erkin, Z. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science Date 2022-08-30 Abstract Machine learning models are increasingly being used within software engineering for their predictions. Research shows that these models’ performance is increasing with new research. This thesis focuses on models for method name prediction, for which the goal is to have a model that can accurately predict method names. With this thesis, we could create a tool that can suggest method names to software developers, which would assist in improving the quality of the projects.This research aims to get insight into the robustness vulnerabilities of a method name prediction model. We use a genetic search algorithm that looks for these robustness problems. The main question this thesis tries to answer is to what extent the performance metrics are affected by applying metamorphic transformations to the test set of a trained code2vec model. Besides this, this thesis also proposes an alternative metric called percentage MRR, which might better reflect the robustness of a model. The main idea behind this metric is that it penalizes the prediction certainty of a model instead of penalizing the prediction rank.To answer this research question, a tool is created that runs a genetic algorithm applying these metamorphic transformations to a dataset that a trained model is then evaluating. With this tool, we conducted 22 genetic search experiments on primary metrics and combinations of metrics to see the trade-offs in the Pareto fronts. The guided search of applying metamorphic transformations on the test set results in an average performance decrease of around 19%. This thesis also compares this drop in performance to the performance decrease a random search algorithm would create. Notably, for every transformer added, the average decrease in performance becomes smaller, and there are transformations, e.g., the if-false-else transformation, that have a bigger effect than others. This thesis concludes that the trained model is not robust against metamorphic transformations and has a significant performance drop. Subject Genetic algorithmMachine learningMetamorphic testingMetamoprhic transformationsNeural networkGenetic searchRobustnessEvaluation frameworksoftware engineering To reference this document use: http://resolver.tudelft.nl/uuid:1d6149f2-353a-4e8a-8f21-7874346cac91 Part of collection Student theses Document type master thesis Rights © 2022 Ruben Marang Files PDF Master_thesis_Ruben_Marang.pdf 12.98 MB Close viewer /islandora/object/uuid:1d6149f2-353a-4e8a-8f21-7874346cac91/datastream/OBJ/view