Unsupervised Cross-lingual Speech Emotion Recognition Using Domainadversarial Neural Network
2020 Β· Xiong Cai, Zhiyong Wu, Kuo Zhong, et al.
Abstract
By using deep learning approaches, Speech Emotion Recog-nition (SER) on a single domain has achieved many excellentresults. However, cross-domain SER is still a challenging taskdue to the distribution shift between source and target domains.In this work, we propose a Domain Adversarial Neural Net-work (DANN) based approach to mitigate this distribution shiftproblem for cross-lingual SER. Specifically, we add a languageclassifier and gradient reversal layer after the feature extractor toforce the learned representation both language-independent andemotion-meaningful. Our method is unsupervised, i. e., labelson target language are not required, which makes it easier to ap-ply our method to other languages. Experimental results showthe proposed method provides an average absolute improve-ment of 3.91% over the baseline system for arousal and valenceclassification task. Furthermore, we find that batch normaliza-tion is beneficial to the performance gain of DANN. Thereforewe also explore th
Authors
(none)
Tags
Stats
Related papers
- Domain Adversarial Learning For Emotion Recognition (2019)0.00
- Self Supervised Adversarial Domain Adaptation For Cross-corpus And Cross-language Speech Emotion Recognition (2022)13.11
- Unsupervised Adversarial Domain Adaptation For Cross-lingual Speech Emotion Recognition (2019)12.74
- Semi-supervised Cross-lingual Speech Emotion Recognition (2022)10.85
- Adversarial Learning Of Raw Speech Features For Domain Invariant Speech Recognition (2018)9.23
- Layer-adapted Implicit Distribution Alignment Networks For Cross-corpus Speech Emotion Recognition (2023)4.52
- Multilingual Speech Emotion Recognition With Multi-gating Mechanism And Neural Architecture Search (2022)2.26
- End-to-end Transfer Learning For Speaker-independent Cross-language And Cross-corpus Speech Emotion Recognition (2023)0.00