Modeling Feature Representations For Affective Speech Using Generative Adversarial Networks
2019 Β· Saurabh Sahu, Rahul Gupta, Carol Espy-Wilson
Abstract
Emotion recognition is a classic field of research with a typical setup extracting features and feeding them through a classifier for prediction. On the other hand, generative models jointly capture the distributional relationship between emotions and the feature profiles. Relatively recently, Generative Adversarial Networks (GANs) have surfaced as a new class of generative models and have shown considerable success in modeling distributions in the fields of computer vision and natural language understanding. In this work, we experiment with variants of GAN architectures to generate feature vectors corresponding to an emotion in two ways: (i) A generator is trained with samples from a mixture prior. Each mixture component corresponds to an emotional class and can be sampled to generate features from the corresponding emotion. (ii) A one-hot vector corresponding to an emotion can be explicitly used to generate the features. We perform analysis on such models and also propose different m
Authors
(none)
Tags
Stats
Related papers
- On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks (2018)12.33
- Generative Adversarial Networks In Human Emotion Synthesis:a Review (2020)11.39
- Adversarial Auto-encoders For Speech Based Emotion Recognition (2018)12.68
- Learning Representations Of Emotional Speech With Deep Convolutional Generative Adversarial Networks (2017)0.00
- Augmenting Generative Adversarial Networks For Speech Emotion Recognition (2020)10.85
- Improving Speech Emotion Recognition With Mutual Information Regularized Generative Model (2025)0.00
- Emotion Detection Using Conditional Generative Adversarial Networks (cgan): A Deep Learning Approach (2025)0.00
- Speech2affectivegestures: Synthesizing Co-speech Gestures With Generative Adversarial Affective Expression Learning (2021)14.35