Linguistic And Gender Variation In Speech Emotion Recognition Using Spectral Features
2021 Β· Zachary Dair, Ryan Donovan, Ruairi O'Reilly
Abstract
This work explores the effect of gender and linguistic-based vocal variations on the accuracy of emotive expression classification. Emotive expressions are considered from the perspective of spectral features in speech (Mel-frequency Cepstral Coefficient, Melspectrogram, Spectral Contrast). Emotions are considered from the perspective of Basic Emotion Theory. A convolutional neural network is utilised to classify emotive expressions in emotive audio datasets in English, German, and Italian. Vocal variations for spectral features assessed by (i) a comparative analysis identifying suitable spectral features, (ii) the classification performance for mono, multi and cross-lingual emotive data and (iii) an empirical evaluation of a machine learning model to assess the effects of gender and linguistic variation on classification accuracy. The results showed that spectral features provide a potential avenue for increasing emotive expression classification. Additionally, the accuracy of emotive
Authors
(none)
Tags
Stats
Related papers
- Emotion Recognition From Speech (2019)0.00
- Leveraged Mel Spectrograms Using Harmonic And Percussive Components In Speech Emotion Recognition (2023)9.03
- A Study Of Language And Classifier-independent Feature Analysis For Vocal Emotion Recognition (2018)0.00
- Objective Human Affective Vocal Expression Detection And Automatic Classification With Stochastic Models And Learning Systems (2019)0.00
- SEGAA: A Unified Approach To Predicting Age, Gender, And Emotion In Speech (2024)0.00
- Multimodal Speech Emotion Recognition And Ambiguity Resolution (2019)0.00
- Emotech: A Multi-modal Speech Emotion Recognition Using Multi-source Low-level Information With Hybrid Recurrent Network (2025)8.35
- Semantic Matters: Multimodal Features For Affective Analysis (2025)0.00