Urban sound classification on the edge: exploring the accuracy-efficiency trade-off

None, None; None, None; None, None; None, None

Urban sound classification on the edge: exploring the accuracy-efficiency trade-off

Conference Paper (2025)

Author(s)

L. Cassens (TU Delft - Transport and Logistics)

M. Kroesen (TU Delft - Transport and Logistics)

S.C. Calvert (TU Delft - Traffic Systems Engineering)

S. van Cranenburgh (TU Delft - Transport and Logistics)

Research Group

Operations & Environment

DOI related publication

https://doi.org/10.61782/fa.2025.0940

Convolutional neural network Edge AI Environmental sound classification

To reference this document use:

https://resolver.tudelft.nl/uuid:f8893acf-e1fd-49ad-96cd-09b4b232af6c

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Operations & Environment

Pages (from-to)

2279-2283

Publisher

European Acoustics Association, EAA

ISBN (print)

978-84-87985-35-5

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Sound source classification is a valuable addition to noise monitoring, providing ‘further insights into local soundscapes. For privacy preservation, this classification often must be conducted on the edge, i.e., in real time on noise sensors. This puts constraints on the size and complexity of the classification models that can be used. Furthermore, there is a trade-off between accuracy and efficiency, which needs to be balanced on battery or solar powered sensors. However, little is known about this trade-off under consideration of constraints imposed by such sensors. In this paper, we explore the scope of sound classification models that can run efficiently on low-cost sound sensors. Specifically, we investigate the Pareto frontiers between model accuracy and computational complexity, providing insights into the trade-off necessary for deploying such models on very constrained hardware. Building on these findings, we train new classification models optimized for edge devices. The models are trained on publicly available audio samples and a new Dutch Urban Sounds dataset specifically collected to enhance the accuracy of sound source classification in urban environments. The models and implementation are open source, enabling researchers and practitioners to adopt, adapt, and build upon our work.

Files

000940.pdf

(pdf | 0.403 Mb)