Print Email Facebook Twitter Impact of audio codec and quality on genre classificaton and BPM recognition in Essentia Title Impact of audio codec and quality on genre classificaton and BPM recognition in Essentia Author Hulleman, Sjoerd (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Liem, C.C.S. (mentor) Kim, Jaehun (mentor) Tielman, M.L. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2022-01-28 Abstract Music Information Retrieval (MIR) is a field of research that focusses on extracting information from music related data. This includes the genre of music and the beats per minute (BPM) of a song. Pipelines that extract this information from music are called feature extractors. Essentia is a library for such feature extraction. Often, the audio codec and quality is not considered in research setups within the field of MIR, while this could have an influence on the results. Therefore the main research question is "How do different audio codecs and audio quality impact genre classification and beats per minute (BPM) recognition in Essentia?". To answer this, the genre has been narrowed down to rock and the chosen audio codecs are FLAC, MP3 LAME and OGG Voribs. In collaboration with Muziekweb, a Dutch music library that collects all music that has been released in The Netherlands, it was possible to gather music files in lossless format. To degrade the audio quality, classify songs and recognize BPM, python pipelines for codec conversion, rock genre classification and BPM recognition were created an ran on this data. It has been concluded that changes in audio codec and quality have an influence on genre classification and BPM recognition in Essentia. It has not been concluded which codec and quality is best to use in the field of MIR. Further research is needed to answer this. Subject MIRMusic information retrievalEssentiaGenre classificationGenreBPM recognitionBPMMusicmultimediamultimedia computingAudio codecscodecMP3OGGFLAC To reference this document use: http://resolver.tudelft.nl/uuid:da4d931a-5ae9-44b4-ba6c-218ebba0300b Bibliographical note https://gitlab.ewi.tudelft.nl/cse3k-21q2-music-faithfulness/project-sjoerd-hulleman GitLab repository containing all code used for this research. Part of collection Student theses Document type bachelor thesis Rights © 2022 Sjoerd Hulleman Files PDF Impact_of_audio_codec_and ... lleman.pdf 674.92 KB Close viewer /islandora/object/uuid:da4d931a-5ae9-44b4-ba6c-218ebba0300b/datastream/OBJ/view