The SVASR System For Text-dependent Speaker Verification (tdsv) AAIC Challenge 2024
2024 Β· Mohammadreza Molavi, Reza Khodadadi
Abstract
This paper introduces an efficient and accurate pipeline for text-dependent speaker verification (TDSV), designed to address the need for high-performance biometric systems. The proposed system incorporates a Fast-Conformer-based ASR module to validate speech content, filtering out Target-Wrong (TW) and Impostor-Wrong (IW) trials. For speaker verification, we propose a feature fusion approach that combines speaker embeddings extracted from wav2vec-BERT and ReDimNet models to create a unified speaker representation. This system achieves competitive results on the TDSV 2024 Challenge test set, with a normalized min-DCF of 0.0452 (rank 2), highlighting its effectiveness in balancing accuracy and robustness.
Authors
(none)
Tags
Stats
Related papers
- Text-dependent Speaker Verification (tdsv) Challenge 2024: Challenge Evaluation Plan (2024)0.00
- Robust Text-dependent Speaker Verification Via Character-level Information Preservation For The Sdsv Challenge 2020 (2020)0.00
- Short-duration Speaker Verification (sdsv) Challenge 2021: The Challenge Evaluation Plan (2019)0.00
- Memory-efficient Training For Text-dependent SV With Independent Pre-trained Models (2024)0.00
- Application Of ASV For Voice Identification After VC And Duration Predictor Improvement In TTS Models (2024)0.00
- Vocal Tract Length Perturbation For Text-dependent Speaker Verification With Autoregressive Prediction Coding (2020)8.09
- Integrating Frequency Translational Invariance In Tdnns And Frequency Positional Information In 2D Resnets To Enhance Speaker Verification (2021)12.68
- A Text-dependent Speaker Verification Application Framework Based On Chinese Numerical String Corpus (2023)0.00