Detecting Socially Significant Music Events using Temporally Noisy Labels

Journal Article (2018)
Author(s)

NK Yadati (TU Delft - Multimedia Computing)

MA Larson (TU Delft - Multimedia Computing, Radboud Universiteit Nijmegen)

Cynthia C.S. Liem (TU Delft - Multimedia Computing)

A. Hanjalic (TU Delft - Intelligent Systems)

Multimedia Computing
Copyright
© 2018 N.K. Yadati, M.A. Larson, C.C.S. Liem, A. Hanjalic
DOI related publication
https://doi.org/10.1109/TMM.2018.2801719
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 N.K. Yadati, M.A. Larson, C.C.S. Liem, A. Hanjalic
Multimedia Computing
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Issue number
9
Volume number
20
Pages (from-to)
2526-2540
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this paper, we focus on event detection over the timeline of a music track. Such technology is motivated by the need for innovative applications such as searching, non-linearaccess and recommendation. Event detection over the timeline requires time-code level labels in order to train machine learning dels. We use timed comments from SoundCloud, a modern social music sharing platform, to obtain these labels. While in this way the need for tedious and time-consuming manual labeling can be reduced, the challenge is that timed comments are subject to additive temporal noise, as they are in the temporal neighborhood of the actual events. We investigate the utility of such noisy timed comments as training labels through a case study, in which we investigate three types of events in Electronic Dance Music (EDM): drop, build and break. These socially significant events play a key role in an EDM track's unfolding and are popular in social media circles. They are therefore not only interesting for detection, but also typically accompanied by timed comments resulting from the online social activity around them. We propose a two-stage learning method that relies on noisy timed comments and, given a music track, marks the events on the timeline. In the experiments, we focus in particular on investigating to which extent noisy timed comments can replace manually added expert labels. The conclusions we draw during this study provide useful insights that motivates further research in the field of event detection.

Files

08279544.pdf
(pdf | 1.38 Mb)
- Embargo expired in 28-03-2019
License info not available