How Reduction in Sample Frequency Hinders the Detection of Words

Bachelor thesis (2022)

Authors

L. Alonso Arenaza Electrical Engineering, Mathematics and Computer Science

Contributors

H.S. Hung Pattern Recognition and Bioinformatics - (mentor)

J.D. Vargas Quiros Pattern Recognition and Bioinformatics - (mentor)

C.A. Raman Pattern Recognition and Bioinformatics - (mentor)

J.A. Baaijens Pattern Recognition and Bioinformatics - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:15e23376-b68c-4029-869f-573efe4e92fc

More Info

expand_more

Published Date

24-06-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Living in a world where every single electronic device is online and interconnected, privacy is a growing concern. Finding the threshold where audio is unintelligible to transcription software is crucial when everything that we say can be recorded. Even if Automated Speech Recognition (ASR) is used in tools, such as Siri or Alexa, designed to ease daily tasks, it could also be used in malicious manners. ASR technology has not been around for too long and like any other new piece of technology, it still has many aspects that have not been looked into and are unknown to the public. This research paper addresses this knowledge gap by examining how sample frequency reduction affects word detection using current well-known transcription software technology such as Google’s speech recognition software and Kaldi’s toolkit. The behavior and performance of these two software pieces have been analyzed for different sample frequencies in the range from 300Hz to 44,1kHz.

Files

Low_Frequency_Detection_Words_... (.pdf)

(.pdf | 2.45 Mb)