Data Hound: Linking Educational Value to LLM Code Completion Performance During Inference

None, None

Data Hound: Linking Educational Value to LLM Code Completion Performance During Inference

Bachelor Thesis (2025)

Author(s)

B.R.M. Annink (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

A. van Deursen – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

M. Izadi – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.B. Katzy – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

R.M. Popescu – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

A. Anand – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

Large Language Models Performance Educational Value Data Smells Code Completion

To reference this document use

https://resolver.tudelft.nl/uuid:6ba0830d-e846-43b5-95a5-743b3345e7a4

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

01-07-2025

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

125

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper investigates the relation between the educational value of input code and the subsequent inference performance of code large language models (LLMs) on completion tasks. Results were attained using The Heap dataset and using SmolLM2, StarCoder 2 and Mellum models. Performance was measured by comparing the generated outputs with the ground truth, where high similarity indicates high performance. We analyse how factors such as language, model size, task type and granularity of educational value affect performance across educational value. We find that most factors do not have a relation with education value, as most metrics plateau except for exact-match. It is observed to have a consistent negative correlation with educational value. Additionally, a consistent turning point is seen around an educational value of 1.75, before which, performance tends to have a more positive relation with educational value. Results highlight the influence of input quality on LLM behaviour and offer insights for more effective training and evaluation strategies.

Files

Datahound_eduv_final_4.pdf

(pdf | 1.09 Mb)

License info not available