Cross-domain Adaptation Of Spoken Language Identification For Related Languages: The Curious Case Of Slavic Languages
2020 · Badr M. Abdullah, Tania Avgustinova, Bernd Möbius, et al.
Abstract
State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language. However, it is still unclear to what extent neural LID models generalize to speech samples with different acoustic conditions due to domain shift. In this paper, we present a set of experiments to investigate the impact of domain mismatch on the performance of neural LID systems for a subset of six Slavic languages across two domains (read speech and radio broadcast) and examine two low-level signal descriptors (spectral and cepstral features) for this task. Our experiments show that (1) out-of-domain speech samples severely hinder the performance of neural LID models, and (2) while both spectral and cepstral features show comparable performance within-domain, spectral features show more robus
Authors
(none)
Tags
Stats
Related papers
- Domain Attentive Fusion For End-to-end Dialect Identification With Unknown Target Domain (2018)0.00
- Investigating The Impact Of Cross-lingual Acoustic-phonetic Similarities On Multilingual Speech Recognition (2022)3.58
- Unsupervised Neural Adaptation Model Based On Optimal Transport For Spoken Language Identification (2020)8.82
- Cross-corpora Language Recognition: A Preliminary Investigation With Indian Languages (2021)6.77
- Source -free Domain Adaptation For Speaker Verification In Data-scarce Languages And Noisy Channels (2024)0.00
- Speaker Verification Using End-to-end Adversarial Language Adaptation (2018)11.19
- Neural Domain Alignment For Spoken Language Recognition Based On Optimal Transport (2023)0.00
- Enhancing Neural Spoken Language Recognition: An Exploration With Multilingual Datasets (2025)0.00