Y

Yang

9 records found

Joint Feature Synthesis and Embedding

Adversarial Cross-Modal Retrieval Revisited

Recently, generative adversarial network (GAN) has shown its strong ability on modeling data distribution via adversarial learning. Cross-modal GAN, which attempts to utilize the power of GAN to model the cross-modal joint distribution and to learn compatible cross-modal features ...
To probe into the mechanical behaviour of railway transition zone from the macro-meso aspects, a numerical model of transition zone is built that hybrids the Discrete Element Method (DEM) and Finite Difference Method (FDM). The DEM is utilised to simulate the ballast bed and slee ...
In this article, we address the problem of visual question generation (VQG), a challenge in which a computer is required to generate meaningful questions about an image targeting a given answer. The existing approaches typically treat the VQG task as a reversed visual question an ...
We systematically study the indirect interaction between a magnon mode and a cavity photon mode mediated by traveling photons of a waveguide. From a general Hamiltonian, we derive the effective coupling strength between two separated modes, and obtain the theoretical expression o ...
A major challenge in matching images and text is that they have intrinsically different data distributions and feature representations. Most existing approaches are based either on embedding or classification, the first one mapping image and text instances into a common embedding ...
Strong noise is one of the toughest problems in the controlled-source electromagnetic (CSEM) method, which highly affects the quality of recorded data. The three main types of noise existing in CSEM data are periodic noise, Gaussian white noise, and nonperiodic noise, among which ...
In this paper, we propose a novel approach to video captioning based on adversarial learning and long short-term memory (LSTM). With this solution concept, we aim at compensating for the deficiencies of LSTM-based video captioning methods that generally show potential to effectiv ...
Cross-modal retrieval aims to enable flexible retrieval experience across different modalities (e.g., texts vs. images). The core of crossmodal retrieval research is to learn a common subspace where the items of different modalities can be directly compared to each other. In this ...
Modern active distribution networks make use of intelligent switching actions to restore supply to end users after faults. This complicates the reliability analysis of such networks, as the number of possible switching actions grows exponentially with network size. This paper pro ...