Incorporation Of Speech Duration Information In Score Fusion Of Speaker Recognition Systems
2016 Β· Ali Khodabakhsh, Seyyed Saeed Sarfjoo, Umut Uludag, et al.
Abstract
In recent years identity-vector (i-vector) based speaker verification (SV) systems have become very successful. Nevertheless, environmental noise and speech duration variability still have a significant effect on degrading the performance of these systems. In many real-life applications, duration of recordings are very short; as a result, extracted i-vectors cannot reliably represent the attributes of the speaker. Here, we investigate the effect of speech duration on the performance of three state-of-the-art speaker recognition systems. In addition, using a variety of available score fusion methods, we investigate the effect of score fusion for those speaker verification techniques to benefit from the performance difference of different methods under different enrollment and test speech duration conditions. This technique performed significantly better than the baseline score fusion methods.
Authors
(none)
Tags
Stats
Related papers
- Quality Measures For Speaker Verification With Short Utterances (2019)0.00
- Speaker Verification In Multi-speaker Environments Using Temporal Feature Fusion (2022)0.00
- Baseline Systems For The First Spoofing-aware Speaker Verification Challenge: Score And Embedding Fusion (2022)6.77
- System Combination For Short Utterance Speaker Recognition (2016)5.84
- Joint Optimization Of Speaker And Spoof Detectors For Spoofing-robust Automatic Speaker Verification (2025)0.00
- Factorization Of Discriminatively Trained I-vector Extractor For Speaker Recognition (2019)0.00
- Automatic Quality Assessment For Audio-visual Verification Systems. The Love Submission To NIST SRE Challenge 2019 (2020)0.00
- Application Of ASV For Voice Identification After VC And Duration Predictor Improvement In TTS Models (2024)0.00