Exploring The Effectiveness Of Self-supervised Learning And Classifier Chains In Emotion Recognition Of Nonverbal Vocalizations
2022 Β· Detai Xin, Shinnosuke Takamichi, Hiroshi Saruwatari
Abstract
We present an emotion recognition system for nonverbal vocalizations (NVs) submitted to the ExVo Few-Shot track of the ICML Expressive Vocalizations Competition 2022. The proposed method uses self-supervised learning (SSL) models to extract features from NVs and uses a classifier chain to model the label dependency between emotions. Experimental results demonstrate that the proposed method can significantly improve the performance of this task compared to several baseline methods. Our proposed method obtained a mean concordance correlation coefficient (CCC) of \(0.725\) in the validation set and \(0.739\) in the test set, while the best baseline method only obtained \(0.554\) in the validation set. We publicate our code at https://github.com/Aria-K-Alethia/ExVo to help others to reproduce our experimental results.
Authors
(none)
Tags
Stats
Code
Related papers
- Self-relation Attention And Temporal Awareness For Emotion Recognition Via Vocal Burst (2022)4.18
- Comparing Supervised And Self-supervised Embedding For Exvo Multi-task Learning Track (2022)0.00
- Self-supervision And Learnable Strfs For Age, Emotion, And Country Prediction (2022)0.00
- Unsupervised Representations Improve Supervised Learning In Speech Emotion Recognition (2023)0.00
- End-to-end Integration Of Speech Emotion Recognition With Voice Activity Detection Using Self-supervised Learning Features (2024)0.00
- Self-supervised Attention Networks And Uncertainty Loss Weighting For Multi-task Emotion Recognition On Vocal Bursts (2022)0.00
- Exploring Self-supervised Multi-view Contrastive Learning For Speech Emotion Recognition With Limited Annotations (2024)3.58
- X-vectors Meet Emotions: A Study On Dependencies Between Emotion And Speaker Recognition (2020)14.23