Simple Tag-based Subclass Representations for Visually-varied Image Classes

Conference Paper (2016)
Author(s)

X. Li (TU Delft - Multimedia Computing)

P. Xu (TU Delft - Pattern Recognition and Bioinformatics)

Y. Shi (Yahoo! Labs)

M.A. Larson (TU Delft - Multimedia Computing, Radboud Universiteit Nijmegen)

A. Hanjalic (TU Delft - Multimedia Computing)

Research Group
Multimedia Computing
DOI related publication
https://doi.org/10.1109/cbmi.2016.7500265
More Info
expand_more
Publication Year
2016
Language
English
Research Group
Multimedia Computing
Pages (from-to)
1-6
ISBN (electronic)
978-1-4673-8695-1

Abstract

In this paper, we present a subclass-representation approach that predicts the probability of a social image belonging to one particular class. We explore the co-occurrence of user-contributed tags to find subclasses with a strong connection to the top level class. We then project each image onto the resulting subclass space, generating a subclass representation for the image. The advantage of our tag-based subclasses is that they have a chance of being more visually stable and easier to model than top-level classes. Our contribution is to demonstrate that a simple and inexpensive method for generating sub-class representations has the ability to improve classification results in the case of tag classes that are visually highly heterogenous. The approach is evaluated on a set of 1 million photos with 10 top-level classes, from the dataset released by the ACM Multimedia 2013 Yahoo! Large-scale Flickr-tag Image Classification Grand Challenge. Experiments show that the proposed system delivers sound performance for visually diverse classes compared with methods that directly model top classes.

No files available

Metadata only record. There are no files for this record.