BIAS in Flemish automatic speech recognition

Conference Paper (2023)
Author(s)

Aaricia Herygers (Technische Hochschule Ingolstadt)

Vass Verkhodanova (Rijksuniversiteit Groningen)

Matt Coler (Rijksuniversiteit Groningen)

Odette Scharenborg (TU Delft - Multimedia Computing)

Munir Georges (Intel Corporation, Technische Hochschule Ingolstadt)

Multimedia Computing
Copyright
© 2023 Aaricia Herygers, Vass Verkhodanova, Matt Coler, O.E. Scharenborg, Munir Georges
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Aaricia Herygers, Vass Verkhodanova, Matt Coler, O.E. Scharenborg, Munir Georges
Multimedia Computing
ISBN (print)
978-3-95908-303-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Research has shown that automatic speech recognition (ASR) systems exhibit biases against different speaker groups, e.g., based on age or gender. This paper presents an investigation into bias in recent Flemish ASR. Seeing as Belgian Dutch, which is also known as Flemish, is often not included in Dutch ASR systems, a state-of-the-art ASR system for Dutch is trained using the Netherlandic Dutch data from the Spoken Dutch Corpus. Using the Flemish data from the JASMIN-CGN corpus, word error rates for various regional variants of Flemish are then compared. In addition, the most misrecognized phonemes are compared across speaker groups. The evaluation confirms a bias against speakers from West Flanders and Limburg, as well as against children, male speakers, and non-native speakers.

Files

ESSV_Herygers_Bias.pdf
(pdf | 0.959 Mb)
License info not available