UTD-CRSS Submission For MGB-3 Arabic Dialect Identification: Front-end And Back-end Advancements On Broadcast Speech
2017 Β· Ahmet E. Bulut, Qian Zhang, Chunlei Zhang, et al.
Abstract
This study presents systems submitted by the University of Texas at Dallas, Center for Robust Speech Systems (UTD-CRSS) to the MGB-3 Arabic Dialect Identification (ADI) subtask. This task is defined to discriminate between five dialects of Arabic, including Egyptian, Gulf, Levantine, North African, and Modern Standard Arabic. We develop multiple single systems with different front-end representations and back-end classifiers. At the front-end level, feature extraction methods such as Mel-frequency cepstral coefficients (MFCCs) and two types of bottleneck features (BNF) are studied for an i-Vector framework. As for the back-end level, Gaussian back-end (GB), and Generative Adversarial Networks (GANs) classifiers are applied alternately. The best submission (contrastive) is achieved for the ADI subtask with an accuracy of 76.94% by augmenting the randomly chosen part of the development dataset. Further, with a post evaluation correction in the submitted system, final accuracy is increase
Authors
(none)
Tags
Stats
Related papers
- MIT-QCRI Arabic Dialect Identification System For The 2017 Multi-genre Broadcast Challenge (2017)8.60
- The MGB-2 Challenge: Arabic Multi-dialect Broadcast Media Recognition (2016)11.76
- LSTM-TDNN With Convolutional Front-end For Dialect Identification In The 2019 Multi-genre Broadcast Challenge (2019)0.00
- Hybrid Deep Learning And Signal Processing For Arabic Dialect Recognition In Low-resource Settings (2025)0.00
- Classifier Ensembles For Dialect And Language Variety Identification (2018)0.00
- Dialectal Coverage And Generalization In Arabic Speech Recognition (2024)4.52
- Unibuckernel Reloaded: First Place In Arabic Dialect Identification For The Second Year In A Row (2018)0.00
- Convolutional Neural Networks And Language Embeddings For End-to-end Dialect Recognition (2018)12.40