Unsupervised Adversarial Domain Adaptation For Cross-lingual Speech Emotion Recognition
2019 Β· Siddique Latif, Junaid Qadir, Muhammad Bilal
Abstract
Cross-lingual speech emotion recognition (SER) is a crucial task for many real-world applications. The performance of SER systems is often degraded by the differences in the distributions of training and test data. These differences become more apparent when training and test data belong to different languages, which cause a significant performance gap between the validation and test scores. It is imperative to build more robust models that can fit in practical applications of SER systems. Therefore, in this paper, we propose a Generative Adversarial Network (GAN)-based model for multilingual SER. Our choice of using GAN is motivated by their great success in learning the underlying data distribution. The proposed model is designed in such a way that can learn language invariant representations without requiring target-language data labels. We evaluate our proposed model on four different language emotional datasets, including an Urdu-language dataset to also incorporate alternative la
Authors
(none)
Tags
Stats
Related papers
- Self Supervised Adversarial Domain Adaptation For Cross-corpus And Cross-language Speech Emotion Recognition (2022)13.11
- Unsupervised Cross-lingual Speech Emotion Recognition Using Domainadversarial Neural Network (2020)0.00
- Augmenting Generative Adversarial Networks For Speech Emotion Recognition (2020)10.85
- Semi-supervised Cross-lingual Speech Emotion Recognition (2022)10.85
- Multilingual Speech Emotion Recognition With Multi-gating Mechanism And Neural Architecture Search (2022)2.26
- Generative Data Augmentation Guided By Triplet Loss For Speech Emotion Recognition (2022)3.58
- Towards Adversarial Learning Of Speaker-invariant Representation For Speech Emotion Recognition (2019)0.00
- On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks (2018)12.33