AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition

None, None; None, None; None, None

AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition

Conference Paper (2025)

Author(s)

C. Bao (Student TU Delft)

Chuanbing Huo (Sanford Health)

C. Gao (TU Delft - Electronics)

Research Group

Electronics

DOI related publication

https://doi.org/10.1109/BioCAS67066.2025.00027

Healthcare Fine-Tuning AIoT Whisper Aphasic Speech Recognition

To reference this document use:

https://resolver.tudelft.nl/uuid:c542463d-b897-4f8c-8883-32accde579dd

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Electronics

Pages (from-to)

76-80

Publisher

IEEE

ISBN (print)

979-8-3315-7337-9

ISBN (electronic)

979-8-3315-7336-2

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper proposes AS-ASR, a lightweight aphasiaspecific speech recognition framework based on Whisper-tiny, tailored for low-resource deployment on edge devices. Our approach introduces a hybrid training strategy that systematically combines standard and aphasic speech at varying ratios, enabling robust generalization, and a GPT-4-based reference enhancement method that refines noisy aphasic transcripts, improving supervision quality. We conduct extensive experiments across multiple data mixing configurations and evaluation settings. Results show that our fine-tuned model significantly outperforms the zero-shot baseline, reducing WER on aphasic speech by over $30 \%$ while preserving performance on standard speech. The proposed framework offers a scalable, efficient solution for realworld disordered speech recognition.

Files

AS-ASR_A_Lightweight_Framework... (pdf)

(pdf | 0.518 Mb)

Taverne

File under embargo until 14-07-2026