A Discriminative Condition-aware Backend For Speaker Verification
2019 Β· Luciana Ferrer, Mitchell McLaren
Abstract
We present a scoring approach for speaker verification that mimics the standard PLDA-based backend process used in most current speaker verification systems. However, unlike the standard backends, all parameters of the model are jointly trained to optimize the binary cross-entropy for the speaker verification task. We further integrate the calibration stage inside the model, making the parameters of this stage depend on metadata vectors that represent the conditions of the signals. We show that the proposed backend has excellent out-of-the-box calibration performance on most of our test sets, making it an ideal approach for cases in which the test conditions are not known and development data is not available for training a domain-specific calibration model.
Authors
(none)
Tags
Stats
Related papers
- A Speaker Verification Backend For Improved Calibration Performance Across Varying Conditions (2020)6.77
- Multiobjective Optimization Training Of PLDA For Speaker Verification (2018)2.26
- Attention Back-end For Automatic Speaker Verification With Multiple Enrollment Utterances (2021)10.21
- Analyzing Speaker Verification Embedding Extractors And Back-ends Under Language And Channel Mismatch (2022)0.00
- Squeezing Value Of Cross-domain Labels: A Decoupled Scoring Approach For Speaker Verification (2020)0.00
- Local Training For PLDA In Speaker Verification (2016)0.00
- Joint Speaker Encoder And Neural Back-end Model For Fully End-to-end Automatic Speaker Verification With Multiple Enrollment Utterances (2022)0.00
- Generalized Domain Adaptation Framework For Parametric Back-end In Speaker Recognition (2023)0.00