Programming Language Models in Multilingual Settings

None, None

Programming Language Models in Multilingual Settings

Conference Paper (2024)

Author(s)

J. Katzy (TU Delft - Software Engineering)

Research Group

Software Engineering

DOI related publication

https://doi.org/10.1145/3639478.3639787

Large Language Models Code Completion Software Engineering Explainable AI Programming Languages Multilingual

To reference this document use:

https://resolver.tudelft.nl/uuid:f42ff2c0-da5a-4b55-9bf9-a4acdc0a02a1

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Software Engineering

Pages (from-to)

204-206

ISBN (electronic)

9798400705021

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large language models have become increasingly utilized in programming contexts. However, due to the recent emergence of this trend, some aspects have been overlooked. We propose a research approach that investigates the inner mechanics of transformer networks, on a neuron, layer, and output representation level, to understand whether there is a theoretical limitation that prevents large language models from performing optimally in a multilingual setting.We propose to approach the investigation into the theoretical limitations, by addressing open problems in machine learning for the software engineering community. This will contribute to a greater understanding of large language models for programming-related tasks, making the findings more approachable to practitioners, and simply their implementation in future models.

Files

3639478.3639787.pdf

(pdf | 0.501 Mb)