Z. Yue | TU Delft Repository

Improving child speech recognition with augmented child-like speech

Conference paper (2024) - Y. Zhang (author) , Zhengjun Yue (author) , T.B. Patel (author) , Odette Scharenborg (author)

State-of-the-art ASRs show suboptimal performance for child speech. The scarcity of child speech limits the development of child speech recognition (CSR). Therefore, we studied child-to-child voice conversion (VC) from existing child speakers in the dataset and additional (new) c ...

Dysarthric Speech Recognition, Detection and Classification using Raw Phase and Magnitude Spectra

Journal article (2023) - Z. Yue (author) , Erfan Loweimi (author) , Zoran Cvetkovic (author)

In this paper, we explore the effectiveness of deploying the raw phase and magnitude spectra for dysarthric speech recognition, detection and classification. In particular, we scrutinise the usefulness of various raw phase-based representations along with their combinations with ...

Exploring Data Augmentation in Bias Mitigation Against Non-Native-Accented Speech

Conference paper (2023) - Y. Zhang (author) , Aaricia Herygers (author) , T.B. Patel (author) , Z. Yue (author) , O.E. Scharenborg (author)

Automatic speech recognition (ASR) should serve every speaker, not only the majority “standard” speakers of a language. In order to build inclusive ASR, mitigating the bias against speaker groups who speak in a “non-standard” or “diverse” way is crucial. We aim to mitigate the bi ...