Dynamic Graph Neural ODE Network For Multi-modal Emotion Recognition In Conversation
2024 Β· Yuntao Shou, Tao Meng, Wei Ai, et al.
Abstract
Multimodal emotion recognition in conversation (MERC) refers to identifying and classifying human emotional states by combining data from multiple different modalities (e.g., audio, images, text, video, etc.). Most existing multimodal emotion recognition methods use GCN to improve performance, but existing GCN methods are prone to overfitting and cannot capture the temporal dependency of the speaker's emotions. To address the above problems, we propose a Dynamic Graph Neural Ordinary Differential Equation Network (DGODE) for MERC, which combines the dynamic changes of emotions to capture the temporal dependency of speakers' emotions, and effectively alleviates the overfitting problem of GCNs. Technically, the key idea of DGODE is to utilize an adaptive mixhop mechanism to improve the generalization ability of GCNs and use the graph ODE evolution network to characterize the continuous dynamics of node representations over time and capture temporal dependencies. Extensive experiments on
Authors
(none)
Tags
Stats
Related papers
- Gsdnet: Revisiting Incomplete Multimodal-diffusion From Graph Spectrum Perspective For Conversation Emotion Recognition (2025)0.00
- Gatedxlstm: A Multimodal Affective Computing Approach For Emotion Recognition In Conversations (2025)0.00
- Bemerc: Behavior-aware Mllm-based Framework For Multimodal Emotion Recognition In Conversation (2025)0.00
- A Comprehensive Survey On Multi-modal Conversational Emotion Recognition With Deep Learning (2023)0.00
- Quality-controlled Multimodal Emotion Recognition In Conversations With Identity-based Transfer Learning And MAMBA Fusion (2025)0.00
- Capturing Spectral And Long-term Contextual Information For Speech Emotion Recognition Using Deep Learning Techniques (2023)0.00
- Conversational Emotion Analysis Via Attention Mechanisms (2019)10.35
- S+PAGE: A Speaker And Position-aware Graph Neural Network Model For Emotion Recognition In Conversation (2021)6.77