Context-aware Neural-based Dialog Act Classification On Automatically Generated Transcriptions
2019 Β· Daniel Ortega, Chia-Yu Li, Gisela Vallejo, et al.
Abstract
This paper presents our latest investigations on dialog act (DA) classification on automatically generated transcriptions. We propose a novel approach that combines convolutional neural networks (CNNs) and conditional random fields (CRFs) for context modeling in DA classification. We explore the impact of transcriptions generated from different automatic speech recognition systems such as hybrid TDNN/HMM and End-to-End systems on the final performance. Experimental results on two benchmark datasets (MRDA and SwDA) show that the combination CNN and CRF improves consistently the accuracy. Furthermore, they show that although the word error rates are comparable, End-to-End ASR system seems to be more suitable for DA classification.
Authors
(none)
Tags
Stats
Related papers
- Exploring Textual And Speech Information In Dialogue Act Classification With Speaker Domain Adaptation (2018)0.00
- Speaker Conditioned Acoustic Modeling For Multi-speaker Conversational ASR (2021)4.52
- Domain Adversarial Neural Networks For Dysarthric Speech Recognition (2020)7.50
- Using Deep Learning Techniques And Inferential Speech Statistics For AI Synthesised Speech Recognition (2021)0.00
- Advancing CTC-CRF Based End-to-end Speech Recognition With Wordpieces And Conformers (2021)0.00
- Towards A Competitive End-to-end Speech Recognition For Chime-6 Dinner Party Transcription (2020)6.77
- Audio-attention Discriminative Language Model For ASR Rescoring (2019)9.23
- Performance Improvements Of Probabilistic Transcript-adapted ASR With Recurrent Neural Network And Language-specific Constraints (2016)0.00