Laughter detection in privacy-sensitive audio

None, None

Laughter detection in privacy-sensitive audio

Bachelor Thesis (2022)

Author(s)

M. Fregonara (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

H.S. Hung – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.D. Vargas Quiros – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.A. Baaijens – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

To reference this document use

https://resolver.tudelft.nl/uuid:ac1aca59-1812-4d3f-99a5-9441f61a6d9a

More Info

expand_more

Publication Year

2022

Language

English

Graduation Date

24-06-2022

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

302

Collections

thesis

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

With the development of new technologies and approaches in the field of social signal processing, concerns regarding privacy around recording conversations have arised. One of the main ways to preserve the privacy of the speakers in recorded conversations consists of decimating said conversations, which consists of reducing the sample frequency and the frequency bandwidth of the audio. This theoretically makes the verbal content of the conversation (the words themselves) unintelligible, while still preserving other useful non-verbal social cues such as laughter, pitch modulation and frequency of speech, amongst others. However, this has not been experimentally verified. This research paper addresses this knowledge gap by exploring the performance of laughter detection machine learning models with decimated audio. An existing pre-trained state-of-the-art laughter detection model was employed and its performance was evaluated for a dataset of decimated audio with sample frequencies ranging from 300Hz to 44100Hz.

Files

Research_paper_5_.pdf

(pdf | 0.734 Mb)

License info not available