Neural Network Based Speaker Classification And Verification Systems With Enhanced Features
2017 Β· Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, et al.
Abstract
This work presents a novel framework based on feed-forward neural network for text-independent speaker classification and verification, two related systems of speaker recognition. With optimized features and model training, it achieves 100% classification rate in classification and less than 6% Equal Error Rate (ERR), using merely about 1 second and 5 seconds of data respectively. Features with stricter Voice Active Detection (VAD) than the regular one for speech recognition ensure extracting stronger voiced portion for speaker recognition, speaker-level mean and variance normalization helps to eliminate the discrepancy between samples from the same speaker. Both are proven to improve the system performance. In building the neural network speaker classifier, the network structure parameters are optimized with grid search and dynamically reduced regularization parameters are used to avoid training terminated in local minimum. It enables the training goes further with lower cost. In spea
Authors
(none)
Tags
Stats
Related papers
- DNN Based Speaker Recognition On Short Utterances (2016)0.00
- Self-adaptive Soft Voice Activity Detection Using Deep Neural Networks For Robust Speaker Verification (2019)6.77
- Joint Speaker Encoder And Neural Back-end Model For Fully End-to-end Automatic Speaker Verification With Multiple Enrollment Utterances (2022)0.00
- Speakernet: 1D Depth-wise Separable Convolutional Network For Text-independent Speaker Recognition And Verification (2020)0.00
- Feature Enhancement With Deep Feature Losses For Speaker Verification (2019)10.61
- Speaker Verification Using Convolutional Neural Networks (2018)0.00
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Deep Speaker Feature Learning For Text-independent Speaker Verification (2017)12.54