Classification Of Vocal Bursts For ACII 2022 A-vb-type Competition Using Convolutional Neural Networks And Deep Acoustic Embeddings
2022 Β· Muhammad Shehram Shah Syed, Zafi Sherhan Syed, Abbas Syed
Abstract
This report provides a brief description of our proposed solution for the Vocal Burst Type classification task of the ACII 2022 Affective Vocal Bursts (A-VB) Competition. We experimented with two approaches as part of our solution for the task at hand. The first of which is based on convolutional neural networks trained on Mel Spectrograms, and the second is based on average pooling of deep acoustic embeddings from a pretrained wav2vec2 model. Our best performing model achieves an unweighted average recall (UAR) of 0.5190 for the test partition, compared to the chance-level UAR of 0.1250 and a baseline of 0.4172. Thus, an improvement of around 20% over the challenge baseline. The results reported in this document demonstrate the efficacy of our proposed approaches to solve the AV-B Type Classification task.
Authors
(none)
Tags
Stats
Related papers
- Self-relation Attention And Temporal Awareness For Emotion Recognition Via Vocal Burst (2022)4.18
- Self-supervised Attention Networks And Uncertainty Loss Weighting For Multi-task Emotion Recognition On Vocal Bursts (2022)0.00
- An Efficient Multitask Learning Architecture For Affective Vocal Burst Analysis (2022)0.00
- Multitask Vocal Burst Modeling With Resnets And Pre-trained Paralinguistic Conformers (2022)0.00
- Burst2vec: An Adversarial Multi-task Approach For Predicting Emotion, Age, And Origin From Vocal Bursts (2022)0.00
- Deep Embeddings For Robust User-based Amateur Vocal Percussion Classification (2022)0.00
- Multimodal Fusion Method With Spatiotemporal Sequences And Relationship Learning For Valence-arousal Estimation (2024)0.00
- Convolutional Neural Networks And X-vector Embedding For DCASE2018 Acoustic Scene Classification Challenge (2018)0.00