Edge AI for Urban Noise Monitoring: Perceptual Soundscape Prediction on Low-Cost Sensors

Master Thesis (2025)
Author(s)

P. Herfkens (TU Delft - Technology, Policy and Management)

Contributor(s)

S Van Cranenburgh – Graduation committee member (TU Delft - Transport and Logistics)

Sepinoud Azimi – Graduation committee member (TU Delft - Information and Communication Technology)

Lion Cassens – Mentor (TU Delft - Transport and Logistics)

Faculty
Technology, Policy and Management
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
14-07-2025
Awarding Institution
Delft University of Technology
Programme
['Engineering and Policy Analysis']
Faculty
Technology, Policy and Management
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This thesis explores the development of lightweight neural network models for predicting perceptual soundscape attributes and environmental sound sources, with the goal of enabling real-time, sensor-based soundscape monitoring in urban environments. Recognizing the limitations of conventional noise metrics, the study adopts a perceptual approach that accounts for how individuals experience urban soundscapes. The work leverages the Affective Responses to Augmented Urban Soundscapes (ARAUS) dataset to replicate and extend the training pipeline of the baseline AD_CNN model. Four parameter-reduction strategies are implemented - shrinking convolutional and dense layers, reducing temporal resolution, and increasing pooling - to produce compact models suitable for low-cost, resource-constrained sensors. Two optimized variants, AD_CNN_dense_layer and AD_CNN_hop_length, are combined into an ensemble model with only ~160K parameters, which outperforms the original AD_CNN in perceptual attribute prediction. A generalisation study evaluates model robustness on a separate urban parks dataset, revealing that while the lightweight ensemble performs well overall, it struggles with certain attributes such as pleasantness. Larger models like SoundAQnet generalise better but remain imperfect. The findings demonstrate the technical feasibility of lightweight perceptual models but also highlight the need for localised fine-tuning to account for subjective and context-dependent factors shaping soundscape perception. This work contributes to scalable, context-aware soundscape monitoring solutions for healthier urban environments.

Files

Thesis_Pepijn_Herfkens.pdf
(pdf | 3.62 Mb)
License info not available