Search results | TU Delft Repositories

Searched for: department%3A%22Intelligent%255C%252BSystems%22

(1 - 5 of 5)

document: Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited
Xu, Xing (author), Lin, Kaiyi (author), Yang, Yang (author), Hanjalic, A. (author), Shen, Heng Tao (author)
Recently, generative adversarial network (GAN) has shown its strong ability on modeling data distribution via adversarial learning. Cross-modal GAN, which attempts to utilize the power of GAN to model the cross-modal joint distribution and to learn compatible cross-modal features, is becoming the research hotspot. However, the existing cross...
journal article 2022

document: Unified Binary Generative Adversarial Network for Image Retrieval and Compression
Song, Jingkuan (author), He, Tao (author), Gao, Lianli (author), Xu, Xing (author), Hanjalic, A. (author), Shen, Heng Tao (author)
Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns...
journal article 2020

document: Matching images and text with multi-modal tensor fusion and re-ranking
Wang, Tan (author), Hanjalic, A. (author), Xu, Xing (author), Shen, Heng Tao (author), Yang, Yang (author), Song, Jingkuan (author)
A major challenge in matching images and text is that they have intrinsically different data distributions and feature representations. Most existing approaches are based either on embedding or classification, the first one mapping image and text instances into a common embedding space for distance measuring, and the second one regarding...
conference paper 2019

document: From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning
Song, Jingkuan (author), Guo, Yuyu (author), Gao, Lianli (author), Li, Xuelong (author), Hanjalic, A. (author), Shen, Heng Tao (author)
Video captioning, in essential, is a complex natural process, which is affected by various uncertainties stemming from video content, subjective judgment, and so on. In this paper, we build on the recent progress in using encoder-decoder framework for video captioning and address what we find to be a critical deficiency of the existing...
journal article 2018

document: Video Captioning by Adversarial LSTM
Yang, Yang (author), Zhou, Jie (author), Ai, Jiangbo (author), Bin, Yi (author), Hanjalic, A. (author), Shen, Heng Tao (author)
In this paper, we propose a novel approach to video captioning based on adversarial learning and long short-term memory (LSTM). With this solution concept, we aim at compensating for the deficiencies of LSTM-based video captioning methods that generally show potential to effectively handle temporal nature of video data when generating...
journal article 2018

Searched for: department%3A%22Intelligent%255C%252BSystems%22

(1 - 5 of 5)