Pitch-synchronous Single Frequency Filtering Spectrogram For Speech Emotion Recognition
2019 Β· Shruti Gupta, Md. Shah Fahad, Akshay Deepak
Abstract
Convolutional neural networks (CNN) are widely used for speech emotion recognition (SER). In such cases, the short time fourier transform (STFT) spectrogram is the most popular choice for representing speech, which is fed as input to the CNN. However, the uncertainty principles of the short-time Fourier transform prevent it from capturing time and frequency resolutions simultaneously. On the other hand, the recently proposed single frequency filtering (SFF) spectrogram promises to be a better alternative because it captures both time and frequency resolutions simultaneously. In this work, we explore the SFF spectrogram as an alternative representation of speech for SER. We have modified the SFF spectrogram by taking the average of the amplitudes of all the samples between two successive glottal closure instants (GCI) locations. The duration between two successive GCI locations gives the pitch, motivating us to name the modified SFF spectrogram as pitch-synchronous SFF spectrogram. The
Authors
(none)
Tags
Stats
Related papers
- Speech Emotion Recognition Via An Attentive Time-frequency Neural Network (2022)12.17
- Leveraged Mel Spectrograms Using Harmonic And Percussive Components In Speech Emotion Recognition (2023)9.03
- Enhanced Speech Emotion Recognition With Efficient Channel Attention Guided Deep Cnn-bilstm Framework (2024)0.00
- Improved Speech Emotion Recognition Using Transfer Learning And Spectrogram Augmentation (2021)12.74
- Real-time Speech Emotion Recognition Based On Syllable-level Feature Extraction (2022)8.09
- Non-linear Frequency Warping Using Constant-q Transformation For Speech Emotion Recognition (2021)8.09
- Learning Spectro-temporal Features With 3D Cnns For Speech Emotion Recognition (2017)10.61
- Searching For Effective Preprocessing Method And Cnn-based Architecture With Efficient Channel Attention On Speech Emotion Recognition (2024)2.26