Task-specific Optimization Of Virtual Channel Linear Prediction-based Speech Dereverberation Front-end For Far-field Speaker Verification
2021 Β· Joon-Young Yang, Joon-Hyuk Chang
Abstract
Developing a single-microphone speech denoising or dereverberation front-end for robust automatic speaker verification (ASV) in noisy far-field speaking scenarios is challenging. To address this problem, we present a novel front-end design that involves a recently proposed extension of the weighted prediction error (WPE) speech dereverberation algorithm, the virtual acoustic channel expansion (VACE)-WPE. It is demonstrated experimentally in this study that unlike the conventional WPE algorithm, the VACE-WPE can be explicitly trained to cancel out both late reverberation and background noise. To build the front-end, the VACE-WPE is first independently (pre)trained to produce "noisy" dereverberated signals. Subsequently, given a pretrained speaker embedding model, the VACE-WPE is additionally fine-tuned within a task-specific optimization (TSO) framework, causing the speaker embedding extracted from the processed signal to be similar to that extracted from the "noise-free" target signal.
Authors
(none)
Tags
Stats
Related papers
- VACE-WPE: Virtual Acoustic Channel Expansion Based On Neural Networks For Weighted Prediction Error-based Speech Dereverberation (2021)3.58
- End-to-end Far-field Speech Recognition With Unified Dereverberation And Beamforming (2020)10.61
- End-to-end Dereverberation, Beamforming, And Speech Recognition With Improved Numerical Stability And Advanced Frontend (2021)10.97
- How To Leverage Dnn-based Speech Enhancement For Multi-channel Speaker Verification? (2022)0.00
- Integrated Speech Enhancement Method Based On Weighted Prediction Error And DNN For Dereverberation And Denoising (2017)0.00
- A Wavenet For Speech Denoising (2017)18.47
- Improved Far-field Speech Recognition Using Joint Variational Autoencoder (2022)0.00
- Neural Network-augmented Kalman Filtering For Robust Online Speech Dereverberation In Noisy Reverberant Environments (2022)0.00