End-to-end Speech To Intent Prediction To Improve E-commerce Customer Support Voicebot In Hindi And English
2022 Β· Abhinav Goyal, Anupam Singh, Nikesh Garera
Abstract
Automation of on-call customer support relies heavily on accurate and efficient speech-to-intent (S2I) systems. Building such systems using multi-component pipelines can pose various challenges because they require large annotated datasets, have higher latency, and have complex deployment. These pipelines are also prone to compounding errors. To overcome these challenges, we discuss an end-to-end (E2E) S2I model for customer support voicebot task in a bilingual setting. We show how we can solve E2E intent classification by leveraging a pre-trained automatic speech recognition (ASR) model with slight modification and fine-tuning on small annotated datasets. Experimental results show that our best E2E model outperforms a conventional pipeline by a relative ~27% on the F1 score.
Authors
(none)
Tags
Stats
Related papers
- End-to-end ASR For Code-switched Hindi-english Speech (2019)0.00
- Attention Based End To End Speech Recognition For Voice Search In Hindi And English (2021)6.77
- Leveraging Unpaired Text Data For Training End-to-end Speech-to-intent Systems (2020)0.00
- Exploring Transfer Learning For End-to-end Spoken Language Understanding (2020)5.24
- End-to-end Spoken Language Understanding For Generalized Voice Assistants (2021)6.34
- End-to-end Spoken Language Understanding: Performance Analyses Of A Voice Command Task In A Low Resource Setting (2022)8.35
- An Investigation Of End-to-end Multichannel Speech Recognition For Reverberant And Mismatch Conditions (2019)0.00
- Improving End-to-end Models For Set Prediction In Spoken Language Understanding (2022)0.00