Bridging The Gap Between Clean Data Training And Real-world Inference For Spoken Language Understanding
2021 Β· di Wu, Yiren Chen, Liang Ding, et al.
Abstract
Spoken language understanding (SLU) system usually consists of various pipeline components, where each component heavily relies on the results of its upstream ones. For example, Intent detection (ID), and slot filling (SF) require its upstream automatic speech recognition (ASR) to transform the voice into text. In this case, the upstream perturbations, e.g. ASR errors, environmental noise and careless user speaking, will propagate to the ID and SF models, thus deteriorating the system performance. Therefore, the well-performing SF and ID models are expected to be noise resistant to some extent. However, existing models are trained on clean data, which causes a \textit\{gap between clean data training and real-world inference.\} To bridge the gap, we propose a method from the perspective of domain adaptation, by which both high- and low-quality samples are embedding into similar vector space. Meanwhile, we design a denoising generation model to reduce the impact of the low-quality sampl
Authors
(none)
Tags
Stats
Related papers
- Large-scale Transfer Learning For Low-resource Spoken Language Understanding (2020)2.26
- Data Augmentation For Spoken Language Understanding Via Pretrained Language Models (2020)0.00
- Learning From Multiple Noisy Augmented Data Sets For Better Cross-lingual Spoken Language Understanding (2021)3.58
- AFD-SLU: Adaptive Feature Distillation For Spoken Language Understanding (2025)0.00
- Recent Advances In End-to-end Spoken Language Understanding (2019)8.09
- Integrating Pretrained ASR And LM To Perform Sequence Generation For Spoken Language Understanding (2023)5.24
- Do As I Mean, Not As I Say: Sequence Loss Training For Spoken Language Understanding (2021)6.77
- Style Attuned Pre-training And Parameter Efficient Fine-tuning For Spoken Language Understanding (2020)6.77