On Residual CNN In Text-dependent Speaker Verification Task
2017 Β· Egor Malykh, Sergey Novoselov, Oleg Kudashev
Abstract
Deep learning approaches are still not very common in the speaker verification field. We investigate the possibility of using deep residual convolutional neural network with spectrograms as an input features in the text-dependent speaker verification task. Despite the fact that we were not able to surpass the baseline system in quality, we achieved a quite good results for such a new approach getting an 5.23% ERR on the RSR2015 evaluation part. Fusion of the baseline and proposed systems outperformed the best individual system by 18% relatively.
Authors
(none)
Tags
Stats
Related papers
- Deep Speaker Feature Learning For Text-independent Speaker Verification (2017)12.54
- Frequency And Temporal Convolutional Attention For Text-independent Speaker Recognition (2019)0.00
- Deep CNN Based Feature Extractor For Text-prompted Speaker Recognition (2018)7.81
- Deep Speaker Embedding Learning With Multi-level Pooling For Text-independent Speaker Verification (2019)0.00
- Speakernet: 1D Depth-wise Separable Convolutional Network For Text-independent Speaker Recognition And Verification (2020)0.00
- On Deep Speaker Embeddings For Text-independent Speaker Recognition (2018)11.93
- Residual Convolutional CTC Networks For Automatic Speech Recognition (2017)0.00
- Rsknet-mtsp: Effective And Portable Deep Architecture For Speaker Verification (2021)9.03