Transfer Learning of Artist Group Factors to Musical Genre Classification

Conference Paper (2018)
Author(s)

Jaehun Kim (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Minz Won (Pompeu Fabra University)

Xavier Serra (Pompeu Fabra University)

Cynthia C. S. Liem (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Multimedia Computing
DOI related publication
https://doi.org/10.1145/3184558.3191823 Final published version
More Info
expand_more
Publication Year
2018
Language
English
Research Group
Multimedia Computing
Pages (from-to)
1929-1934
ISBN (print)
978-1-4503-5640-4
Event
WWW 2018 (2018-04-23 - 2018-04-27), Lyon, France
Downloads counter
288
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The automated recognition of music genres from audio information is a challenging problem, as genre labels are subjective and noisy. Artist labels are less subjective and less noisy, while certain artists may relate more strongly to certain genres. At the same time, at prediction time, it is not guaranteed that artist labels are available for a given audio segment. Therefore, in this work, we propose to apply the transfer learning framework, learning artist-related information which will be used at inference time for genre classification. We consider different types of artist-related information, expressed through artist group factors, which will allow for more efficient learning and stronger robustness to potential label noise. Furthermore, we investigate how to achieve the highest validation accuracy on the given FMA dataset, by experimenting with various kinds of transfer methods, including single-task transfer, multi-task transfer and finally multi-task learning.