AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
This paper proposes AS-ASR, a lightweight aphasiaspecific speech recognition framework based on Whisper-tiny, tailored for low-resource deployment on edge devices. Our approach introduces a hybrid training strategy that systematically combines standard and aphasic speech at varying ratios, enabling robust generalization, and a GPT-4-based reference enhancement method that refines noisy aphasic transcripts, improving supervision quality. We conduct extensive experiments across multiple data mixing configurations and evaluation settings. Results show that our fine-tuned model significantly outperforms the zero-shot baseline, reducing WER on aphasic speech by over $30 \%$ while preserving performance on standard speech. The proposed framework offers a scalable, efficient solution for realworld disordered speech recognition.
Files
File under embargo until 14-07-2026