BLEU it All Away!

None, None

BLEU it All Away!

Refocussing SE ML on the Homo Sapience

Abstract (2022)

Author(s)

L.H. Applis (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Software Engineering

To reference this document use

https://resolver.tudelft.nl/uuid:ad5f84b9-880c-48f0-a9a7-586634b47374

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Software Engineering

Event

International Summer School on Search- and Machine Learning-based Software Engineering (2022-06-22 - 2022-06-24), Cordoba, Spain

Downloads counter

70

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Many tasks in machine learning for software engineering
rely on prominent NLP metrics, such as the BLEU or
ROUGE score. The metrics are under heavy criticism themselves
within the NLP community, but the SE community adapted them
for lack of better alternatives. Within this paper, we summarize
some of the problems with common metrics at the examples of
code and look for alternatives. We argue that our only hope is
the worst of all possible options: Humans.

Files

Paper.pdf

(pdf | 0.281 Mb)