A Study on Reference Microphone Selection for Multi-Microphone Speech Enhancement

Journal article (2020)

Authors

Jie Zhang University of Science and Technology of China (USTC), Hefei

Huawei Chen Nanjing University of Aeronautics and Astronautics

R.C. Hendriks Signal Processing Systems -

Research Group

Signal Processing Systems () (TU Delft)

DOI

https://doi.org/10.1109/TASLP.2020.3039930

Array signal processing Microphone arrays Speech enhancement Relative acoustic transfer function Microphones Noise reduction Signal to noise ratio Low-rank approximation Acoustic distortion Multi-channel beamforming Reference microphone Variable span linear filters

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:da333e94-0495-42eb-b3ee-3ea0ce8acbd8

Published Date

2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Microelectronics

Research Group

Signal Processing Systems

Abstract

Multi-microphone speech enhancement methods typically require a reference position with respect to which the target signal is estimated. Often, this reference position is arbitrarily chosen as one of the reference microphones. However, it has been shown that the choice of the reference microphone can have a significant impact on the final noise reduction performance. In this paper, we therefore theoretically analyze the impact of selecting a reference on the noise reduction performance with near-end noise being taken into account. Following the generalized eigenvalue decomposition (GEVD) based optimal variable span filtering framework, we find that for any linear beamformer, the output signal-to-noise ratio (SNR) taking both the near-end and far-end noise into account is reference dependent. Only when the near-end noise is neglected, the output SNR of rank-1 beamformers does not depend on the reference position. However, in general for rank-r beamformers with r>1 (e.g., the multichannel Wiener filter) the performance does depend on the reference position. Based on these, we propose an optimal algorithm for microphone reference selection that maximizes the output SNR. In addition, we propose a lower-complexity algorithm that is still optimal for rank-1 beamformers, but sub-optimal for the general rank-r beamformers. Experiments using a simulated microphone array validate the effectiveness of both proposed methods and show that in terms of quality, several dB can be gained by selecting the proper reference microphone.

Files

09272831.pdf

(.pdf | 2.27 Mb)

09272831.pdf

(.pdf | 3.89 Mb)