The Exploitation Of Multiple Feature Extraction Techniques For Speaker Identification In Emotional States Under Disguised Voices
2021 Β· Noor Ahmad Al Hindawi, Ismail Shahin, Ali Bou Nassif
Abstract
Due to improvements in artificial intelligence, speaker identification (SI) technologies have brought a great direction and are now widely used in a variety of sectors. One of the most important components of SI is feature extraction, which has a substantial impact on the SI process and performance. As a result, numerous feature extraction strategies are thoroughly investigated, contrasted, and analyzed. This article exploits five distinct feature extraction methods for speaker identification in disguised voices under emotional environments. To evaluate this work significantly, three effects are used: high-pitched, low-pitched, and Electronic Voice Conversion (EVC). Experimental results reported that the concatenated Mel-Frequency Cepstral Coefficients (MFCCs), MFCCs-delta, and MFCCs-delta-delta is the best feature extraction method.
Authors
(none)
Tags
Stats
Related papers
- A Comparative Re-assessment Of Feature Extractors For Deep Speaker Embeddings (2020)8.09
- Vocal Style Factorization For Effective Speaker Recognition In Affective Scenarios (2023)0.00
- Feature Selection Enhancement And Feature Space Visualization For Speech-based Emotion Recognition (2022)7.50
- Identifying Speakers Using Their Emotion Cues (2018)10.85
- New Insights On Target Speaker Extraction (2022)0.00
- Deep Learning For Speaker Identification: Architectural Insights From AB-1 Corpus Analysis And Performance Evaluation (2024)0.00
- Emotion Invariant Speaker Embeddings For Speaker Identification With Emotional Speech (2020)0.00
- Is Style All You Need? Dependencies Between Emotion And Gst-based Speaker Recognition (2022)0.00