Acoustic Reflectors Localization from Stereo Recordings Using Neural Networks

Conference Paper (2021)
Author(s)

Giovanni Bologni (Student TU Delft)

R. Heusdens (TU Delft - Signal Processing Systems)

Jorge Martinez (TU Delft - Multimedia Computing)

Research Group
Signal Processing Systems
DOI related publication
https://doi.org/10.1109/ICASSP39728.2021.9414473
More Info
expand_more
Publication Year
2021
Language
English
Research Group
Signal Processing Systems
Pages (from-to)
461-465
ISBN (print)
978-1-7281-7606-2
ISBN (electronic)
978-1-7281-7605-5

Abstract

Acoustic room geometry estimation is often performed in ad hoc settings, i.e., using multiple microphones and sources distributed around the room, or assuming control over the excitation signals. We propose a fully convolutional network (FCN) that localizes reflective surfaces under the relaxed assumptions that (i) a compact array of only two microphones is available, (ii) emitter and receivers are not synchronized, and (iii) both the excitation signals and the impulse responses of the enclosures are unknown. Our FCN is trained in a supervised fashion to predict the likelihood of reflective surfaces at specific distances and directions-of-arrival (DOA). When a single reflective surface is present, up to 80% of real and virtual sources are detected, while this figure approaches 50% in rectangular rooms. Experiments on real-world recordings report similar accuracy as with artificially reverberated speech signals, validating the generalization capabilities of the framework.

No files available

Metadata only record. There are no files for this record.