A Study on Reference Microphone Selection for Multi-Microphone Speech Enhancement

Journal Article (2020)
Author(s)

Jie Zhang (University of Science and Technology of China (USTC), Hefei)

Huawei Chen (Nanjing University of Aeronautics and Astronautics)

Richard C. Hendriks (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Signal Processing Systems
DOI related publication
https://doi.org/10.1109/TASLP.2020.3039930 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Research Group
Signal Processing Systems
Journal title
IEEE/ACM Transactions on Audio Speech and Language Processing
Volume number
29 (2021)
Article number
9272831
Pages (from-to)
671-683
Downloads counter
365
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Multi-microphone speech enhancement methods typically require a reference position with respect to which the target signal is estimated. Often, this reference position is arbitrarily chosen as one of the reference microphones. However, it has been shown that the choice of the reference microphone can have a significant impact on the final noise reduction performance. In this paper, we therefore theoretically analyze the impact of selecting a reference on the noise reduction performance with near-end noise being taken into account. Following the generalized eigenvalue decomposition (GEVD) based optimal variable span filtering framework, we find that for any linear beamformer, the output signal-to-noise ratio (SNR) taking both the near-end and far-end noise into account is reference dependent. Only when the near-end noise is neglected, the output SNR of rank-1 beamformers does not depend on the reference position. However, in general for rank-r beamformers with r>1 (e.g., the multichannel Wiener filter) the performance does depend on the reference position. Based on these, we propose an optimal algorithm for microphone reference selection that maximizes the output SNR. In addition, we propose a lower-complexity algorithm that is still optimal for rank-1 beamformers, but sub-optimal for the general rank-r beamformers. Experiments using a simulated microphone array validate the effectiveness of both proposed methods and show that in terms of quality, several dB can be gained by selecting the proper reference microphone.

Files

09272831.pdf
(pdf | 2.27 Mb)
License info not available