Augmenting Generative Adversarial Networks For Speech Emotion Recognition
2020 Β· Siddique Latif, Muhammad Asim, Rajib Rana, et al.
Abstract
Generative adversarial networks (GANs) have shown potential in learning emotional attributes and generating new data samples. However, their performance is usually hindered by the unavailability of larger speech emotion recognition (SER) data. In this work, we propose a framework that utilises the mixup data augmentation scheme to augment the GAN in feature learning and generation. To show the effectiveness of the proposed framework, we present results for SER on (i) synthetic feature vectors, (ii) augmentation of the training data with synthetic features, (iii) encoded features in compressed representation. Our results show that the proposed framework can effectively learn compressed emotional representations as well as it can generate synthetic samples that help improve performance in within-corpus and cross-corpus evaluation.
Authors
(none)
Tags
Stats
Related papers
- Generative Data Augmentation Guided By Triplet Loss For Speech Emotion Recognition (2022)3.58
- On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks (2018)12.33
- A Preliminary Study On Augmenting Speech Emotion Recognition Using A Diffusion Model (2023)0.00
- Hybrid Data Augmentation And Deep Attention-based Dilated Convolutional-recurrent Neural Networks For Speech Emotion Recognition (2021)12.81
- Generative Emotional AI For Speech Emotion Recognition: The Case For Synthetic Emotional Speech Augmentation (2023)11.19
- Improving Speech Emotion Recognition With Mutual Information Regularized Generative Model (2025)0.00
- Unsupervised Adversarial Domain Adaptation For Cross-lingual Speech Emotion Recognition (2019)12.74
- Adversarial Machine Learning And Speech Emotion Recognition: Utilizing Generative Adversarial Networks For Robustness (2018)0.00