Gen-AI Meets Domain Expertise: LLMs for Domain Specific Code Generation

None, None

Gen-AI Meets Domain Expertise: LLMs for Domain Specific Code Generation

A study conducted at the ASML leveling department

Master Thesis (2025)

Author(s)

Y. Mundhra (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Maliheh Izadi – Mentor (TU Delft - Software Engineering)

FA Kuipers – Mentor (TU Delft - Networked Systems)

Max Valk – Mentor (ASML)

Lewis Binns – Mentor (ASML)

Ujwal Gadiraju – Graduation committee member (TU Delft - Web Information Systems)

Goran Brkic – Mentor (ASML)

Faculty

Electrical Engineering, Mathematics and Computer Science

Software Engineering Large Language Models (LLMs) Code generation AI4SE Domain Specific Code

To reference this document use:

https://resolver.tudelft.nl/uuid:59dc22db-d928-41bd-abad-3c01a033fad6

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

17-07-2025

Awarding Institution

Delft University of Technology

Programme

['Computer Science | Data Science and Technology']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large Language Models (LLMs) have shown impressive performance in various domains, including software engineering. Code generation, a crucial aspect of software development, has seen significant improvements with the integration of AI tools. While existing LLMs have show very good performance in generating code for everyday tasks, their application in industrial settings and domain-specific contexts remains largely unexplored. This thesis investigates the potential of LLMs to generate code in proprietary, domain-specific environments, with a specific focus on the leveling department at ASML. The primary goal of this research is to assess the ability of LLMs to adapt to a domain they have not encountered before and to generate complex, interdependent code in a domain-specific repository. This involves evaluating the performance of LLMs in generating code that meets the specific requirements of ASML. To achieve this, the thesis investigates various prompting techniques, compares the performance of generic and code-specific LLMs, and examines the impact of model size on code generation capabilities. To evaluate the code generation capabilities of LLMs in repository-level scenarios, we introduce a new performance metric, build@k, designed to measure the effectiveness of generated code in compiling and building projects. The results showed that both prompting techniques and model size have a substantial influence on the code generation capabilities of LLMs. However, the performance difference between code-specific and generic LLMs was less pronounced and varied substantially across different model families.

Files

MSc_Thesis_YMundhra_Final.pdf

(pdf | 6.32 Mb)

License info not available