VoxCeleb-1
Emerging46papers using it
2022first seen
VoxCeleb1 is a dataset used to evaluate speaker verification systems, containing a diverse collection of utterances from thousands of speakers.
Papers using VoxCeleb-1 (44)
- Why Does Self-supervised Learning For Speech Recognition Benefit Speaker Recognition?Self-supervised Speaker Verification Using Dynamic Loss-gate And Label CorrectionOne-step Knowledge Distillation And Fine-tuning In Using Large Pre-trained Self-supervised Learning Models For Speaker VerificationA Joint Noise Disentanglement And Adversarial Training Framework For Robust Speaker VerificationSpeaker Recognition Using Isomorphic Graph Attention Network Based Pooling On Self-supervised RepresentationFew-shot Speaker Identification Using Depthwise Separable Convolutional Network With Channel AttentionVoiceextender: Short-utterance Text-independent Speaker Verification With Guided Diffusion ModelUniversal Speaker Recognition Encoders For Different Speech Segments DurationReDimNet2: Scaling Speaker Verification via Time-Pooled Dimension ReshapingSpeaker Verification with Speech-Aware LLMs: Evaluation and AugmentationAn Effective Transformer-based Contextual Model And Temporal Gate Pooling For Speaker IdentificationDAME: Duration-Aware Matryoshka Embedding for Duration-Robust Speaker VerificationEffective Modeling of Critical Contextual Information for TDNN-based Speaker VerificationText-Independent Speaker Identification Using Audio Looping With Margin Based Loss FunctionsInvestigation of Zero-shot Text-to-Speech Models for Enhancing Short-Utterance Speaker VerificationWhisper-pmfa: Partial Multi-scale Feature Aggregation For Speaker Verification Using Whisper ModelsDynamic Kernels And Channel Attention For Low Resource Speaker VerificationSpeaker Verification Using Attentive Multi-scale Convolutional Recurrent NetworkECAPA2: A Hybrid Neural Network Architecture And Training Strategy For Robust Speaker EmbeddingsComputing with Hypervectors for Efficient Speaker IdentificationSelf-Supervised Training of Speaker Encoder with Multi-Modal Diverse
Positive PairsDynamic Kernels and Channel Attention for Low Resource Speaker
VerificationSpeaker Recognition Using Isomorphic Graph Attention Network Based
Pooling on Self-Supervised RepresentationAn Effective Transformer-based Contextual Model and Temporal Gate
Pooling for Speaker IdentificationWhisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker
Verification using Whisper ModelsText-To-Speech Synthesis In The WildSelf-Supervised Speaker Verification Using Dynamic Loss-Gate and Label
CorrectionNon-Contrastive Self-Supervised Learning of Utterance-Level Speech
RepresentationsMeWEHV: Mel and Wave Embeddings for Human Voice TasksUniversal speaker recognition encoders for different speech segments
durationIncorporating Uncertainty from Speaker Embedding Estimation to Speaker
VerificationImproving Speaker Verification with Self-Pretrained Transformer ModelsOne-Step Knowledge Distillation and Fine-Tuning in Using Large
Pre-Trained Self-Supervised Learning Models for Speaker VerificationSpeaker verification using attentive multi-scale convolutional recurrent
networkExperimenting with Additive Margins for Contrastive Self-Supervised Speaker VerificationAsymmetric Clean Segments-Guided Self-Supervised Learning for Robust
Speaker VerificationVoiceExtender: Short-utterance Text-independent Speaker Verification
with Guided Diffusion ModelA New Perspective on Speaker Verification: Joint Modeling with DFSMN and
TransformerECAPA2: A Hybrid Neural Network Architecture and Training Strategy for
Robust Speaker EmbeddingsEfficient Adapter Tuning of Pre-trained Speech Models for Automatic
Speaker VerificationIntegrated Multi-Level Knowledge Distillation for Enhanced Speaker
VerificationCA-SSLR: Condition-Aware Self-Supervised Learning Representation for
Generalized Speech ProcessingFew-Shot Speaker Identification Using Lightweight Prototypical Network
with Feature Grouping and InteractionA Joint Noise Disentanglement and Adversarial Training Framework for
Robust Speaker Verification