Assessing the Suitability of Panako for Music Identification in Movies

Bachelor Thesis (2021)
Author(s)

R.K. Nair (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Cynthia Liem – Mentor (TU Delft - Multimedia Computing)

Jeahun Kim – Mentor (TU Delft - Multimedia Computing)

J.H. Krijthe – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2021 Ruben Nair
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Ruben Nair
Graduation Date
02-07-2021
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Audio fingerprinting is a technique that allows for fast identification of music. Research concerning this technique first emerged around the 2000s and has lead to several applications, like Shazam. More recently, developments in this area have slowed down, even though there are still new challenges emerging. This paper investigates one of these challenges, music identification of movies, in a systematic way using the open-source fingerprinting framework called Panako. First, clips containing music were extracted from movies and queried using the default settings for Panako. Then, movie soundtracks were modified by layering noise over them, or by time-stretching and pitch-shifting them, and the performance of Panako on these modified audio signals was evaluated using a benchmark. Finally, the best configurations from the previous step were again queried using actual movie clips. These tests showed that both the default configuration and the configurations that performed best on the synthesised data perform poorly in movie music identification. Less than 10\% of the clips were identified correctly. The limited scope of this research, combined with the results gathered, show that there should be further investigation into the suitability of Panako for movie music identification.

Files

License info not available