How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification?
2022 Β· Sandipana Dowerah, Romain Serizel, Denis Jouvet, et al.
Abstract
Speaker verification (SV) suffers from unsatisfactory performance in far-field scenarios due to environmental noise andthe adverse impact of room reverberation. This work presents a benchmark of multichannel speech enhancement for far-fieldspeaker verification. One approach is a deep neural network-based, and the other is a combination of deep neural network andsignal processing. We integrated a DNN architecture with signal processing techniques to carry out various experiments. Ourapproach is compared to the existing state-of-the-art approaches. We examine the importance of enrollment in pre-processing,which has been largely overlooked in previous studies. Experimental evaluation shows that pre-processing can improve the SVperformance as long as the enrollment files are processed similarly to the test data and that test and enrollment occur within similarSNR ranges. Considerable improvement is obtained on the generated and all the noise conditions of the VOiCES dataset.
Authors
(none)
Tags
Stats
Related papers
- Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild (2020)0.00
- Parameterized Channel Normalization For Far-field Deep Speaker Verification (2021)3.58
- Multi-channel Speaker Verification For Single And Multi-talker Speech (2020)0.00
- Feature Enhancement With Deep Feature Losses For Speaker Verification (2019)10.61
- Deep Speaker Embeddings For Far-field Speaker Recognition On Short Utterances (2020)11.29
- Analysis Of DNN Speech Signal Enhancement For Robust Speaker Recognition (2018)11.39
- On The Role Of Spatial, Spectral, And Temporal Processing For Dnn-based Non-linear Multi-channel Speech Enhancement (2022)7.81
- NPU Speaker Verification System For INTERSPEECH 2020 Far-field Speaker Verification Challenge (2020)7.50