Intelligibility Enhancement Based on Mutual Information

Journal Article (2017)
Author(s)

S Khademi (TU Delft - Pattern Recognition and Bioinformatics)

Richard Hendriks (TU Delft - Signal Processing Systems)

W.B. Kleijn (TU Delft - Signal Processing Systems)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1109/TASLP.2017.2714424
More Info
expand_more
Publication Year
2017
Language
English
Research Group
Pattern Recognition and Bioinformatics
Issue number
8
Volume number
25
Pages (from-to)
1694-1708

Abstract

Speech intelligibility enhancement is considered for multiple-microphone acquisition and single loudspeaker rendering. This is based on the mutual information measured between the message spoken at far-end environment and the message perceived by a listener at near-end. We prove that the joint optimal processing can be decomposed into far-end and near-end processing. The former is a minimum variance distortionless response beamformer that reduces the noise in the talker environment and the latter is a post-filter that redistributes the power over the frequency bands. Disjoint processing is optimal provided that the post-filtering operation is aware of the residual noise from the beamforming operation. Our results show that both processing steps are necessary for the effective conveyance of a message and, importantly, that the second step must be aware of the remaining noise from the beamforming operation in the first step. In addition, we study the use of the mutual information applied on the perceptually more relevant powers per critical band.

No files available

Metadata only record. There are no files for this record.