VoxLingua-107
Emerging8papers using it
2022first seen
VoxLingua107 is a dataset used to evaluate language identification models, containing diverse spoken samples across multiple languages and dialects.
Papers using VoxLingua-107 (8)
- Pretraining Approaches For Spoken Language Recognition: Taltech Submission To The OLR 2021 ChallengeA Compact End-to-end Model With Local And Global Context For Spoken Language IdentificationJoint Unsupervised And Supervised Learning For Context-aware Language IdentificationA Compact End-to-End Model with Local and Global Context for Spoken
Language IdentificationAccidental Learners: Spoken Language Identification in Multilingual
Self-Supervised ModelsJoint unsupervised and supervised learning for context-aware language
identificationEfficient Spoken Language Recognition via Multilabel ClassificationTowards spoken dialect identification of Irish