THE ACOUSTICBRAINZ GENRE DATASET

MULTI-SOURCE, MULTI-LEVEL, MULTI-LABEL, AND LARGE-SCALE

Conference Paper (2019)
Author(s)

Dmitry Bogdanov (Pompeu Fabra University)

Alastair Porter (Pompeu Fabra University)

Hendrik Schreiber (Tagtraum Industries Incorporated)

Julián Urbano (TU Delft - Multimedia Computing)

Sergio Oramas (Pandora)

Copyright
© 2019 Dmitry Bogdanov, Alastair Porter, Hendrik Schreiber, Julián Urbano, Sergio Oramas
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 Dmitry Bogdanov, Alastair Porter, Hendrik Schreiber, Julián Urbano, Sergio Oramas
Pages (from-to)
360-367
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper introduces the AcousticBrainz Genre Dataset, a large-scale collection of hierarchical multi-label genre annotations from different metadata sources. It allows researchers to explore how the same music pieces are annotated differently by different communities following their
own genre taxonomies, and how this could be addressed
by genre recognition systems. Genre labels for the dataset
are sourced from both expert annotations and crowds, permitting comparisons between strict hierarchies and folksonomies. Music features are available via the AcousticBrainz database. To guide research, we suggest a concrete research task and provide a baseline as well as an
evaluation method. This task may serve as an example
of the development and validation of automatic annotation algorithms on complementary datasets with different
taxonomies and coverage. With this dataset, we hope to
contribute to developments in content-based music genre
recognition as well as cross-disciplinary studies on genre
metadata analysis