Dd
Dimme de Groot
11 records found
1
Auditory Kernels for Representing Degraded Speech
Auditory Kernels in an Efficient Representation of Degraded Speech
We explore the use of biologically inspired auditory kernels—learned from sparse coding on (clean) read speech—to analyze and reconstruct signals degraded with additive noise. Auditory kernels mimic spectrotemporal filters in the human auditory system, offering insight into how s
...
Effcient Auditory Coding for Bat Vocalizations
Testing Auditory Kernel Effciency on Rhinolophus Affnis Calls
Abstract—Effcient neural coding is a theoretical model in sensory neuroscience, positing that biological systems maximize information transfer to the brain while minimizing neural resources. While this concept has been extensively studied in the context of human speech perception
...
Assessing speech intelligibility for individuals with dysarthria is critical for understanding the severity of motor speech disorders and evaluating speech therapy interventions. Traditional subjective as- sessments, while effective, are resource-intensive and prone to bias, whic
...
Clear communication in public address systems is essential, especially in environments where safety or information clarity is critical. Speech intelligibility is often assessed using objective intelligibility metrics (OIMs), which predict intelligibility through mathematical mode
...
Objective speech intelligibility metrics (OIMs) are widely used in various fields, including public ser- vice announcements. These metrics do not directly predict the intelligibility of a speech (defined as the ratio of understandable words in an audio sample), but produce values
...
This study investigates the performance of two objective speech quality metrics, Perceptual Evaluation of Speech Quality (PESQ) and Virtual Speech Quality Objective Listener (ViSQOL), in predicting human-rated speech quality scores, which are essential for telecommunication syste
...
Reverberation is a key aspect when designing the interior of buildings, and must be carefully considered in the context of the function of the room. Defined by the reverberation time (RT), it is known to have a big influence on the intelligibility and quality of audio in closed s
...
Evaluation of Perceptual Accuracy in Simulated Room Impulse Responses
Designing and Implementing a Subjective Testing Methodology for the Perceptual Evaluation of Simulated Room Impulse Responses
The accurate simulation of Room Impulse Responses (RIRs) is important in a variety of applications in acoustics such as automatic speech recognition, speech enhancement, and architectural acoustic design. While objective metrics for evaluating RIRs have been researched extensivel
...
In building design, it is important to consider certain materials for certain acoustical properties. Specifically, the time it takes for an audio signal to decrease in volume by 60 dB is important. This can be estimated with Sabine's and Eyring's formula's, which both make use of
...
Estimating reverberation time (RT60) accurately is crucial for enhancing the acoustic quality of various environments as it decides how you feel the sound fades away subjectively. Traditional methods, such as Sabine's equation, require extensive prior knowledge and assume ideal c
...
A Room Impulse Response (RIR) is a mathematical model for sound propagation in a room. Estimating RIR parameters such as the reverberation time (T60) allows Automatic Speech Recognition (ASR) systems to adapt to reverberation in input signals by changing their behavior based on t
...