Z. Yue | TU Delft Repository

End-to-end acoustic-articulatory dysarthric speech recognition leveraging large-scale pretrained acoustic features

Conference paper (2025) - Z. Yue (author) , Y. Zhang (author)

Automatic dysarthric speech recognition (ADSR) remains challenging due to the irregularities in speech caused by motor control impairments and the limited availability of dysarthric speech data. This paper explores the integration of articulatory features, captured using Electrom ...

Improving child speech recognition with augmented child-like speech

Conference paper (2024) - Y. Zhang (author) , Z. Yue (author) , T.B. Patel (author) , Odette Scharenborg (author)

State-of-the-art ASRs show suboptimal performance for child speech. The scarcity of child speech limits the development of child speech recognition (CSR). Therefore, we studied child-to-child voice conversion (VC) from existing child speakers in the dataset and additional (new) c ...

Exploring Data Augmentation in Bias Mitigation Against Non-Native-Accented Speech

Conference paper (2023) - Y. Zhang (author) , Aaricia Herygers (author) , T.B. Patel (author) , Z. Yue (author) , Odette Scharenborg (author)

Automatic speech recognition (ASR) should serve every speaker, not only the majority “standard” speakers of a language. In order to build inclusive ASR, mitigating the bias against speaker groups who speak in a “non-standard” or “diverse” way is crucial. We aim to mitigate the bi ...

Dysarthric Speech Recognition, Detection and Classification using Raw Phase and Magnitude Spectra

Journal article (2023) - Zhengjun Yue (author) , Erfan Loweimi (author) , Zoran Cvetkovic (author)

In this paper, we explore the effectiveness of deploying the raw phase and magnitude spectra for dysarthric speech recognition, detection and classification. In particular, we scrutinise the usefulness of various raw phase-based representations along with their combinations with ...