An Empirical Analysis on the Performance of UniXcoder

Bachelor thesis (2022)

Authors

T.O. van Dam Electrical Engineering, Mathematics and Computer Science

Contributors

M. Izadi Software Engineering - (supervisor 1)

A. van Deursen Software Technology (supervisor 1)

A. Lukina Algorithmics - (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

Code completion Type annotations

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:ccf8b865-931c-480b-a5dd-4e4120a20126

Published Date

24-06-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Numerous papers have empirically studied the performance of deep learning based code completion models. However, none of these papers considered nor investigated whether good performance on statically typed languages translates to good performance on dynamically typed languages. A lack of available type information can make code completion more difficult, as many types are interacted with differently. However, natural language in the form of comments could compensate for a lack of available type information. This paper evaluates whether UniXcoder, a state of the NLP model, is able to perform code completion on both dynamically and statically typed languages with similar performance. Furthermore, the impact of the presence of type annotations and comments is assessed. We show that UniXcoder is able to utilize type annotations and comments in order to improve code completion performance, and that using only singleline comments yields better results than using all comments in the source code.

Files

An_Empirical_Analysis_on_the_P... (.pdf)

(.pdf | 0.351 Mb)