Multi-channel Speaker Verification For Single And Multi-talker Speech
2020 Β· Saurabh Kataria, Shi-Xiong Zhang, Dong Yu
Abstract
To improve speaker verification in real scenarios with interference speakers, noise, and reverberation, we propose to bring together advancements made in multi-channel speech features. Specifically, we combine spectral, spatial, and directional features, which includes inter-channel phase difference, multi-channel sinc convolutions, directional power ratio features, and angle features. To maximally leverage supervised learning, our framework is also equipped with multi-channel speech enhancement and voice activity detection. On all simulated, replayed, and real recordings, we observe large and consistent improvements at various degradation levels. On real recordings of multi-talker speech, we achieve a 36% relative reduction in equal error rate w.r.t. single-channel baseline. We find the improvements from speaker-dependent directional features more consistent in multi-talker conditions than clean. Lastly, we investigate if the learned multi-channel speaker embedding space can be made m
Authors
(none)
Tags
Stats
Related papers
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Target Speaker Extraction For Overlapped Multi-talker Speaker Verification (2019)0.00
- Multi-channel Target Speech Extraction With Channel Decorrelation And Target Speaker Adaptation (2020)0.00
- Speaker Verification In Multi-speaker Environments Using Temporal Feature Fusion (2022)0.00
- Time-domain Speech Extraction With Spatial Information And Multi Speaker Conditioning Mechanism (2021)7.81
- Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild (2020)0.00
- Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays (2023)0.00
- Joint Speaker Features Learning For Audio-visual Multichannel Speech Separation And Recognition (2024)0.00