Analysis Of Constant-q Filterbank Based Representations For Speech Emotion Recognition
2022 Β· Premjeet Singh, Shefali Waldekar, Md Sahidullah, et al.
Abstract
This work analyzes the constant-Q filterbank-based time-frequency representations for speech emotion recognition (SER). Constant-Q filterbank provides non-linear spectro-temporal representation with higher frequency resolution at low frequencies. Our investigation reveals how the increased low-frequency resolution benefits SER. The time-domain comparative analysis between short-term mel-frequency spectral coefficients (MFSCs) and constant-Q filterbank-based features, namely constant-Q transform (CQT) and continuous wavelet transform (CWT), reveals that constant-Q representations provide higher time-invariance at low-frequencies. This provides increased robustness against emotion irrelevant temporal variations in pitch, especially for low-arousal emotions. The corresponding frequency-domain analysis over different emotion classes shows better resolution of pitch harmonics in constant-Q-based time-frequency representations than MFSC. These advantages of constant-Q representations are fur
Authors
(none)
Tags
Stats
Related papers
- Non-linear Frequency Warping Using Constant-q Transformation For Speech Emotion Recognition (2021)8.09
- Speech Emotion Recognition Using Quaternion Convolutional Neural Networks (2021)12.74
- Pitch-synchronous Single Frequency Filtering Spectrogram For Speech Emotion Recognition (2019)11.19
- Enhanced Speech Emotion Recognition With Efficient Channel Attention Guided Deep Cnn-bilstm Framework (2024)0.00
- Temporal-frequency State Space Duality: An Efficient Paradigm For Speech Emotion Recognition (2024)7.50
- Leveraged Mel Spectrograms Using Harmonic And Percussive Components In Speech Emotion Recognition (2023)9.03
- Speecheq: Speech Emotion Recognition Based On Multi-scale Unified Datasets And Multitask Learning (2022)5.84
- Sigwavnet: Learning Multiresolution Signal Wavelet Network For Speech Emotion Recognition (2025)8.48