Guest Editorial Learning From Noisy Multimedia Data

None, None; None, None; None, None; None, None; None, None; None, None; None, None

Guest Editorial Learning From Noisy Multimedia Data

Review (2022)

Author(s)

Jian Zhang (University of Technology Sydney)

Alan Hanjalic (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Ramesh Jain (University of California)

Xiansheng Hua (Alibaba Damo Academy)

Shin'ichi Satoh (National Institute of Informatics)

Yazhou Yao (Nanjing University of Science and Technology)

Dan Zeng (Shanghai University)

Department

Intelligent Systems

DOI related publication

https://doi.org/10.1109/TMM.2022.3159014 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:d801dcfd-1b30-4823-89f1-37218a9ec4c3

More Info

expand_more

Publication Year

2022

Language

English

Department

Intelligent Systems

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Journal title

IEEE Transactions on Multimedia

Volume number

24

Pages (from-to)

1247-1252

Downloads counter

263

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This special issue provides a premier forum for researchers in multimedia big data to share challenges and recent advancements in learning from noisy multimedia data. The multimedia age and its proliferation of devices and platforms is fueling exponential data growth. As computational power and deep learning algorithms rapidly evolve, the web has become a rich source of potential training data for robust machine learning, with search engines such as Google and Bing, Twitter, TikTok, Instagram, and short video sharing platforms offering large-scale data points in the hundreds of millions. The concurrent shift in the Internet to richer web data modalities such as text, audio, image, and video reveal further opportunities to leverage large-scale data for the automatic construction of a variety of datasets for model training and testing. However, the ubiquity of multimedia data means noise is a fundamental challenge, with a label noisea and a domain mismatcha the most critical issues in automatically collected datasets. Learning from noisy multimedia data tends towards poor performance, making it increasingly essential to address these challenges.

Files

Guest_Editorial_Learning_From_... (pdf)

(pdf | 0.898 Mb)

- Embargo expired in 01-07-2023

License info not available