Applying Large-Scale Weakly Supervised Automatic Speech Recognition to Air Traffic Control

None, None

Applying Large-Scale Weakly Supervised Automatic Speech Recognition to Air Traffic Control

Master Thesis (2023)

Author(s)

J.L.P.M. van Doorn (TU Delft - Aerospace Engineering)

Contributor(s)

Junzi Sun – Mentor (TU Delft - Control & Simulation)

J.M. Hoekstra – Graduation committee member (TU Delft - Control & Simulation)

Patrick Jonk – Mentor (Royal Netherlands Aerospace Centre NLR)

Vincent de Vries – Graduation committee member (Royal Netherlands Aerospace Centre NLR)

Faculty

Aerospace Engineering

Copyright

ASR Air Traffic Control Automatic Speech Recognition ATC Whisper

To reference this document use:

https://resolver.tudelft.nl/uuid:8aa780bf-47b6-4f81-b112-29e23bc06a7d

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

11-12-2023

Awarding Institution

Delft University of Technology

Programme

Aerospace Engineering

Abstract

The application of automatic speech recognition in the air traffic control domain has been researched extensively. However, its primary application remains in the training and simulation of air traffic controllers. This is due to the insufficient performance of automatic speech recognition in specific environments, such as air traffic control, where strong performance and safety requirements are paramount. This study demonstrates how a large-scale, weakly supervised automatic speech recognition model, Whisper, could meet these performance requirements and establish a new approach to air traffic control communication. Fine-tuning Whisper in the air traffic control domain resulted in a word error rate of 13.5% on the ATCO2 dataset and 1.17% on the ATCOSIM dataset. Furthermore, the study reveals that fine-tuning with region-specific data can enhance performance by up to 60% in real-world scenarios.

Files

MSc_Thesis_Report_Jan_van_Door... (pdf)

(pdf | 1.46 Mb)

License info not available