Transfer Learning of Artist Group Factors to Musical Genre Classification

Conference Paper (2018)
Author(s)

Jeahun Kim (TU Delft - Multimedia Computing)

Minz Won (Pompeu Fabra University)

Xavier Serra (Pompeu Fabra University)

Cynthia Liem (TU Delft - Multimedia Computing)

Multimedia Computing
Copyright
© 2018 Jaehun Kim, Minz Won, Xavier Serra, C.C.S. Liem
DOI related publication
https://doi.org/10.1145/3184558.3191823
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 Jaehun Kim, Minz Won, Xavier Serra, C.C.S. Liem
Multimedia Computing
Pages (from-to)
1929-1934
ISBN (print)
978-1-4503-5640-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The automated recognition of music genres from audio information is a challenging problem, as genre labels are subjective and noisy. Artist labels are less subjective and less noisy, while certain artists may relate more strongly to certain genres. At the same time, at prediction time, it is not guaranteed that artist labels are available for a given audio segment. Therefore, in this work, we propose to apply the transfer learning framework, learning artist-related information which will be used at inference time for genre classification. We consider different types of artist-related information, expressed through artist group factors, which will allow for more efficient learning and stronger robustness to potential label noise. Furthermore, we investigate how to achieve the highest validation accuracy on the given FMA dataset, by experimenting with various kinds of transfer methods, including single-task transfer, multi-task transfer and finally multi-task learning.