Meta-learning Framework For End-to-end Imposter Identification In Unseen Speaker Recognition
2023 Β· Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose
Abstract
Speaker identification systems are deployed in diverse environments, often different from the lab conditions on which they are trained and tested. In this paper, first, we show the problem of generalization using fixed thresholds (computed using EER metric) for imposter identification in unseen speaker recognition and then introduce a robust speaker-specific thresholding technique for better performance. Secondly, inspired by the recent use of meta-learning techniques in speaker verification, we propose an end-to-end meta-learning framework for imposter detection which decouples the problem of imposter detection from unseen speaker identification. Thus, unlike most prior works that use some heuristics to detect imposters, the proposed network learns to detect imposters by leveraging the utterances of the enrolled speakers. Furthermore, we show the efficacy of the proposed techniques on VoxCeleb1, VCTK and the FFSVC 2022 datasets, beating the baselines by up to 10%.
Authors
(none)
Tags
Stats
Related papers
- Meta-learning For Short Utterance Speaker Recognition With Imbalance Length Pairs (2020)15.61
- Hiddenspeaker: Generate Imperceptible Unlearnable Audios For Speaker Verification System (2024)2.26
- Improved Meta-learning Training For Speaker Verification (2021)4.52
- Neural Scoring: A Refreshed End-to-end Approach For Speaker Recognition In Complex Conditions (2024)0.00
- Improved Relation Networks For End-to-end Speaker Verification And Identification (2022)2.26
- Masked Proxy Loss For Text-independent Speaker Verification (2020)2.26
- Unified Hypersphere Embedding For Speaker Recognition (2018)0.00
- SEEF-ALDR: A Speaker Embedding Enhancement Framework Via Adversarial Learning Based Disentangled Representation (2019)3.58