Challenges and practical guidelines for atypical speech data collection, annotation, usage and sharing

None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None

Challenges and practical guidelines for atypical speech data collection, annotation, usage and sharing

A multi-project perspective

Conference Paper (2025)

Author(s)

Zhengjun Yue (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Mara Barberis (Katholieke Universiteit Leuven)

Tanvina Patel (TU Delft - Electrical Engineering, Mathematics and Computer Science, Erasmus MC)

Judith Dineley (King’s College London)

Willemijn Doedens (Koninklijke Auris Groep)

Lottie Stipdonk (Erasmus MC)

Yuanyuan Zhang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Elke De Witte (Erasmus MC)

Odette Scharenborg (TU Delft - Electrical Engineering, Mathematics and Computer Science)

undefined More Authors

Research Group

Multimedia Computing

Automatic speech recognition Dutch atypical speech Speech annotation Speech data collection

DOI related publication

https://doi.org/10.21437/Interspeech.2025-2774 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:c8045e26-81fc-4a13-b751-16cb96437dd9

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Multimedia Computing

Pages (from-to)

3943-3947

Publisher

International Speech Communication Association

Event

26th Interspeech Conference 2025 (2025-08-17 - 2025-08-21), Rotterdam, Netherlands

Downloads counter

89

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Speech technologies have advanced significantly, yet they remain largely trained on typical speech, limiting their applicability to individuals with speech and language impairments. A key obstacle is the lack of well-annotated and representative atypical speech corpora. This paper conducts a multi-project survey and shares the first-hand experience on the challenges of collecting, annotating, using, and sharing atypical speech data. Experiences from seven research projects on collecting atypical speech data, involving both academic and clinical perspectives, are reported and potential issues are discussed. Furthermore, the paper provides practical guidelines that allow for standardisation and harmonisation of data collection practices, which are crucial to allow studies to be compared, replicated, and validated, which is essential for developing more inclusive and effective speech technologies.

Files

Yue25_interspeech.pdf

(pdf | 0.493 Mb)

License info not available