Optimizing Speech Emotion Recognition Using Manta-ray Based Feature Selection
2020 Β· Soham Chattopadhyay, Arijit Dey, Hritam Basak
Abstract
Emotion recognition from audio signals has been regarded as a challenging task in signal processing as it can be considered as a collection of static and dynamic classification tasks. Recognition of emotions from speech data has been heavily relied upon end-to-end feature extraction and classification using machine learning models, though the absence of feature selection and optimization have restrained the performance of these methods. Recent studies have shown that Mel Frequency Cepstral Coefficients (MFCC) have been emerged as one of the most relied feature extraction methods, though it circumscribes the accuracy of classification with a very small feature dimension. In this paper, we propose that the concatenation of features, extracted by using different existing feature extraction methods can not only boost the classification accuracy but also expands the possibility of efficient feature selection. We have used Linear Predictive Coding (LPC) apart from the MFCC feature extraction
Authors
(none)
Tags
Stats
Related papers
- Feature Selection Enhancement And Feature Space Visualization For Speech-based Emotion Recognition (2022)7.50
- Evaluating Raw Waveforms With Deep Learning Frameworks For Speech Emotion Recognition (2023)0.00
- Emotion Recognition From Speech (2019)0.00
- Multimodal Speech Emotion Recognition And Ambiguity Resolution (2019)0.00
- Temporal-frequency State Space Duality: An Efficient Paradigm For Speech Emotion Recognition (2024)7.50
- Multi-channel Auto-encoder For Speech Emotion Recognition (2018)0.00
- Real-time Speech Emotion Recognition Based On Syllable-level Feature Extraction (2022)8.09
- Fusion Approaches For Emotion Recognition From Speech Using Acoustic And Text-based Features (2024)12.25