Speech Emotion Recognition Via Contrastive Loss Under Siamese Networks
2019 Β· Zheng Lian, Ya Li, Jianhua Tao, et al.
Abstract
Speech emotion recognition is an important aspect of human-computer interaction. Prior work proposes various end-to-end models to improve the classification performance. However, most of them rely on the cross-entropy loss together with softmax as the supervision component, which does not explicitly encourage discriminative learning of features. In this paper, we introduce the contrastive loss function to encourage intra-class compactness and inter-class separability between learnable features. Furthermore, multiple feature selection methods and pairwise sample selection methods are evaluated. To verify the performance of the proposed system, we conduct experiments on The Interactive Emotional Dyadic Motion Capture (IEMOCAP) database, a common evaluation corpus. Experimental results reveal the advantages of the proposed method, which reaches 62.19% in the weighted accuracy and 63.21% in the unweighted accuracy. It outperforms the baseline system that is optimized without the contrastiv
Authors
(none)
Tags
Stats
Related papers
- Supervised Contrastive Learning With Nearest Neighbor Search For Speech Emotion Recognition (2023)7.16
- A Cross-corpus Speech Emotion Recognition Method Based On Supervised Contrastive Learning (2024)0.00
- Attention Based Fully Convolutional Network For Speech Emotion Recognition (2018)15.25
- Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study On The Impact Of Input Features, Signal Length, And Acted Speech (2017)16.14
- Contrastive Regularization For Multimodal Emotion Recognition Using Audio And Text (2022)0.00
- Learning Discriminative Features From Spectrograms Using Center Loss For Speech Emotion Recognition (2025)12.10
- "I Have Vxxx Bxx Connexxxn!": Facing Packet Loss In Deep Speech Emotion Recognition (2020)0.00
- Learning Discriminative Features Using Center Loss And Reconstruction As Regularizer For Speech Emotion Recognition (2019)0.00