Yu Gu | TU Delft Repository

TDMER

A Task-Driven Method for Multimodal Emotion Recognition

Journal article (2025) - Qian Xu (author) , Yu Gu (author) , Chenyu Li (author) , He Zhang (author) , Hai-Xiang Lin (author) , Linsong Liu (author)

In multimodal emotion recognition, disentangled representation learning method effectively address the inherent heterogeneity among modalities. To facilitate the flexible integration of enhanced disentangled features into multimodal emotional features, we propose a task-driven mu ...

Scene-Speaker Emotion Aware Network

Dual Network Strategy for Conversational Emotion Recognition

Journal article (2025) - Bingni Li (author) , Yu Gu (author) , Chenyu Li (author) , He Zhang (author) , Linsong Liu (author) , Hai Xiang Lin (author) , Shuang Wang (author)

Incorporating external knowledge has been shown to improve emotion understanding in dialogues by enriching contextual information, such as character motivations, psychological states, and causal relations between events. Filtering and categorizing this information can significant ...

Graph Convolution-Based Decoupling and Consistency-Driven Fusion for Multimodal Emotion Recognition

Journal article (2025) - Yingmin Deng (author) , Chenyu Li (author) , Yu Gu (author) , He Zhang (author) , Linsong Liu (author) , Hai Xiang Lin (author) , Shuang Wang (author) , Hanlin Mo (author)

Multimodal emotion recognition (MER) is essential for understanding human emotions from diverse sources such as speech, text, and video. However, modality heterogeneity and inconsistent expression pose challenges for effective feature fusion. To address this, we propose a novel M ...

Hybrid Contrastive Learning Decoupling Speech Emotion Recognition

Journal article (2025) - Chenyu Li (author) , Yu Gu (author) , He Zhang (author) , Linsong Liu (author) , Hai-Xiang Lin (author) , Shuang Wang (author)

Speech signals contain rich information, such as textual content, emotion, and speaker identity. To extract these features more efficiently, researchers are investigating joint training across multiple tasks, like Speech Emotion Recognition (SER) and Speaker Verification (SV), ai ...

Speech Emotion Recognition with Log-Gabor Filters

Conference paper (2016) - Hai-Xiang Lin (author) , Yu Gu (author)

Speech emotion recognition has been a prevalent research topic in recent years. Existing speech emotion recognition approaches mainly involve processing and analyzing speech signals, in order to discern the speaker’s emotions in speech. 2D Gabor filters have been used to extract ...

Speech Emotion Recognition Using Voiced Segment Selection Algorithm

Conference paper (2016) - Yu Gu (author) , Eric Postma (author) , Hai-Xiang Lin (author) , Jaap Van Den Herik (author)

Speech emotion recognition (SER) poses one of the major challenges in human-machine interaction. We propose a new algorithm, the Voiced Segment Selection (VSS) algorithm, which can produce an accurate segmentation of speech signals. The VSS algorithm deals with the voiced signal ...