On speech enhancement in very low SNRs for smart speakers

None, None

On speech enhancement in very low SNRs for smart speakers

Master Thesis (2018)

Author(s)

K.A. Sachos (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R. Heusdens – Mentor

Martin Bo Møller – Mentor

Pablo Martinez-Nuevo – Mentor

Jesper Kjaer Nielsen – Mentor

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Adaptive Filtering Speech enhancement Smart speakers Speech separation

To reference this document use:

https://resolver.tudelft.nl/uuid:67e1ea69-6d46-4d0c-9840-a27c0b126854

More Info

expand_more

Publication Year

2018

Language

English

Copyright

Graduation Date

19-10-2018

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering']

Abstract

Human interaction with a smart speaker involves often distant automatic speech recognition (ASR). However, ASR is a rather cumbersome task at significantly high levels of noise. Most of commercial smart speakers in order to achieve high ASR accuracy they tend to reduce the playback signal once the preset keyword is detected. In an effort to dispose this function from the smart speaker, in this thesis a speech enhancement technique is considered in the front-end of the ASR system aiming at the suppression of the dominant noise component in the degraded speech signal. Having a priori knowledge on the playback signal renders adaptive filtering a well-suited speech technique. Therefore, the class of least mean squares (LMS) algorithms is studied and assessed. Among other techniques of this class the transform domain LMS (TDLMS), due to its inherent signal decorrelation properties, is shown to achieve the best performance in terms of noise suppression and improved speech intelligibility as well as word error rate. The results of this study correspond to a set of simulation incorporating real impulse responses measured in both an anechoic and a reverberant environment.

Files

ThesisReport_K.Sachos_final_.p... (pdf)

(pdf | 1.52 Mb)

License info not available