Guest Editorial Learning From Noisy Multimedia Data

Review (2022)
Author(s)

Jian Zhang (University of Technology Sydney)

A Hanjalic (TU Delft - Intelligent Systems)

Ramesh Jain (University of California)

Xiansheng Hua (Alibaba Damo Academy)

Shin'ichi Satoh (National Institute of Informatics)

Yazhou Yao (Nanjing University of Science and Technology)

Dan Zeng (Shanghai University)

Department
Intelligent Systems
Copyright
© 2022 Jian Zhang, A. Hanjalic, Ramesh Jain, Xiansheng Hua, Shin'ichi Satoh, Yazhou Yao, Dan Zeng
DOI related publication
https://doi.org/10.1109/TMM.2022.3159014
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Jian Zhang, A. Hanjalic, Ramesh Jain, Xiansheng Hua, Shin'ichi Satoh, Yazhou Yao, Dan Zeng
Department
Intelligent Systems
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Volume number
24
Pages (from-to)
1247-1252
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This special issue provides a premier forum for researchers in multimedia big data to share challenges and recent advancements in learning from noisy multimedia data. The multimedia age and its proliferation of devices and platforms is fueling exponential data growth. As computational power and deep learning algorithms rapidly evolve, the web has become a rich source of potential training data for robust machine learning, with search engines such as Google and Bing, Twitter, TikTok, Instagram, and short video sharing platforms offering large-scale data points in the hundreds of millions. The concurrent shift in the Internet to richer web data modalities such as text, audio, image, and video reveal further opportunities to leverage large-scale data for the automatic construction of a variety of datasets for model training and testing. However, the ubiquity of multimedia data means noise is a fundamental challenge, with a label noisea and a domain mismatcha the most critical issues in automatically collected datasets. Learning from noisy multimedia data tends towards poor performance, making it increasingly essential to address these challenges.

Files

Guest_Editorial_Learning_From_... (pdf)
(pdf | 0.898 Mb)
- Embargo expired in 01-07-2023
License info not available