Data-Driven Extract Method Recommendations: A Study at ING

None, None; None, None; None, None; None, None; None, None; None, None

Data-Driven Extract Method Recommendations: A Study at ING

Conference Paper (2021)

Author(s)

David van der Leij (Student TU Delft, ING Bank)

J.R. Binda (ING Bank)

Robbert van Dalen (ING Bank)

Pieter Vallen (ING Bank)

Yaping Luo (ING Bank, Eindhoven University of Technology)

Maurício Aniche (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Software Engineering

Software Engineering Machine Learning for Software Engineering Software Refactoring

DOI related publication

https://doi.org/10.1145/3468264.3473927 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:4470f6e1-6e3b-4112-86a3-dbe9a421aec3

More Info

expand_more

Publication Year

2021

Language

English

Research Group

Software Engineering

Pages (from-to)

1337-1347

ISBN (electronic)

978-1-4503-8562-6

Event

ESEC/FSE 2021: 29th ACM Joint European<br/>Software Engineering Conference and Symposium on the Foundations of Software<br/>Engineering (2021-08-23 - 2021-08-28), athens, Greece

Downloads counter

317

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The sound identification of refactoring opportunities is still an open problem in software engineering. Recent studies have shown the effectiveness of machine learning models in recommending methods that should undergo different refactoring operations. In this work, we experiment with such approaches to identify methods that should undergo an Extract Method refactoring, in the context of ING, a large financial organization. More specifically, we (i) compare the code metrics distributions, which are used as features by the models, between open-source and ING systems, (ii) measure the accuracy of different machine learning models in recommending Extract Method refactorings, (iii) compare the recommendations given by the models with the opinions of ING experts. Our results show that the feature distributions of ING systems and open-source systems are somewhat different, that machine learning models can recommend Extract Method refactorings with high accuracy, and that experts tend to agree with most of the recommendations of the model.

Files

3468264.3473927.pdf

(pdf | 0.621 Mb)