BIAS in Flemish automatic speech recognition

Conference Paper (2023)
Author(s)

Aaricia Herygers (Technische Hochschule Ingolstadt)

Vass Verkhodanova (Rijksuniversiteit Groningen)

Matt Coler (Rijksuniversiteit Groningen)

O.E. Scharenborg (TU Delft - Multimedia Computing)

Munir Georges (Intel Corporation, Technische Hochschule Ingolstadt)

Research Group
Multimedia Computing
More Info
expand_more
Publication Year
2023
Language
English
Research Group
Multimedia Computing
ISBN (print)
978-3-95908-303-4
Event
ESSV Konferenz Elektronische Sprachsignalverarbeitung (2023-03-01 - 2023-03-03), Munich, Germany
Downloads counter
287
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Research has shown that automatic speech recognition (ASR) systems exhibit biases against different speaker groups, e.g., based on age or gender. This paper presents an investigation into bias in recent Flemish ASR. Seeing as Belgian Dutch, which is also known as Flemish, is often not included in Dutch ASR systems, a state-of-the-art ASR system for Dutch is trained using the Netherlandic Dutch data from the Spoken Dutch Corpus. Using the Flemish data from the JASMIN-CGN corpus, word error rates for various regional variants of Flemish are then compared. In addition, the most misrecognized phonemes are compared across speaker groups. The evaluation confirms a bias against speakers from West Flanders and Limburg, as well as against children, male speakers, and non-native speakers.

Files

ESSV_Herygers_Bias.pdf
(pdf | 0.959 Mb)
License info not available