Automated Classification of Overfitting Patches With Statically Extracted Code Features

Journal article (2022)

Authors

He Ye KTH Royal Institute of Technology

Jian Gu KTH Royal Institute of Technology

Matias Martinez Université de Valenciennes et du Hainaut Cambrésis

T. Durieux KTH Royal Institute of Technology

Martin Monperrus KTH Royal Institute of Technology

Affiliation

External organisation

Automatic program repair Code features Overfitting patch Patch assessment

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:4a9b25a1-418f-4a2d-83cc-ab28b41c941f

Published Date

01-08-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Affiliation

External organisation

Abstract

Automatic program repair (APR) aims to reduce the cost of manually fixing software defects. However, APR suffers from generating a multitude of overfitting patches, those patches that fail to correctly repair the defect beyond making the tests pass. This paper presents a novel overfitting patch detection system called ODS to assess the correctness of APR patches. ODS first statically compares a patched program and a buggy program in order to extract code features at the abstract syntax tree (AST) level, for the single programming language Java. Then, ODS uses supervised learning with the captured code features and patch correctness labels to automatically learn a probabilistic model. The learned ODS model can then finally be applied to classify new and unseen program repair patches. We conduct a large-scale experiment to evaluate the effectiveness of ODS on patch correctness classification based on 10,302 patches from Defects4J, Bugs.jar and Bears benchmarks. The empirical evaluation shows that ODS is able to correctly classify 71.9 percent of program repair patches from 26 projects, which improves the state-of-the-art. ODS is applicable in practice and can be employed as a post-processing procedure to classify the patches generated by different APR systems.