Novel Hybrid DNN Approaches For Speaker Verification In Emotional And Stressful Talking Environments
2021 Β· Ismail Shahin, Ali Bou Nassif, Nawel Nemmour, et al.
Abstract
In this work, we conducted an empirical comparative study of the performance of text-independent speaker verification in emotional and stressful environments. This work combined deep models with shallow architecture, which resulted in novel hybrid classifiers. Four distinct hybrid models were utilized: deep neural network-hidden Markov model (DNN-HMM), deep neural network-Gaussian mixture model (DNN-GMM), Gaussian mixture model-deep neural network (GMM-DNN), and hidden Markov model-deep neural network (HMM-DNN). All models were based on novel implemented architecture. The comparative study used three distinct speech datasets: a private Arabic dataset and two public English databases, namely, Speech Under Simulated and Actual Stress (SUSAS) and Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). The test results of the aforementioned hybrid models demonstrated that the proposed HMM-DNN leveraged the verification performance in emotional and stressful environments. Resu
Authors
(none)
Tags
Stats
Related papers
- Novel Cascaded Gaussian Mixture Model-deep Neural Network Classifier For Speaker Identification In Emotional Talking Environments (2018)12.74
- Three-stage Speaker Verification Architecture In Emotional Talking Environments (2018)7.16
- Speaker Verification In Emotional Talking Environments Based On Three-stage Framework (2018)6.34
- Multi-modal Hybrid Deep Neural Network For Speech Enhancement (2016)0.00
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- DNN Based Speaker Recognition On Short Utterances (2016)0.00
- Accounting For Variations In Speech Emotion Recognition With Nonparametric Hierarchical Neural Network (2021)0.00
- Self-adaptive Soft Voice Activity Detection Using Deep Neural Networks For Robust Speaker Verification (2019)6.77