Low-resource Contextual Topic Identification On Speech
2018 Β· Chunxi Liu, Matthew Wiesner, Shinji Watanabe, et al.
Abstract
In topic identification (topic ID) on real-world unstructured audio, an audio instance of variable topic shifts is first broken into sequential segments, and each segment is independently classified. We first present a general purpose method for topic ID on spoken segments in low-resource languages, using a cascade of universal acoustic modeling, translation lexicons to English, and English-language topic classification. Next, instead of classifying each segment independently, we demonstrate that exploring the contextual dependencies across sequential segments can provide large improvements. In particular, we propose an attention-based contextual model which is able to leverage the contexts in a selective manner. We test both our contextual and non-contextual models on four LORELEI languages, and on all but one our attention-based contextual model significantly outperforms the context-independent models.
Authors
(none)
Tags
Stats
Related papers
- Topic Identification For Speech Without ASR (2017)7.16
- Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information (2023)4.52
- Joint Unsupervised And Supervised Learning For Context-aware Language Identification (2023)2.26
- Attention-based Contextual Language Model Adaptation For Speech Recognition (2021)0.00
- Intent Recognition And Unsupervised Slot Identification For Low Resourced Spoken Dialog Systems (2021)2.26
- Spoken Term Detection Methods For Sparse Transcription In Very Low-resource Settings (2021)0.00
- Audio-based Linguistic Feature Extraction For Enhancing Multi-lingual And Low-resource Text-to-speech (2024)0.00
- Attention And DCT Based Global Context Modeling For Text-independent Speaker Recognition (2022)7.50