Incorporating End-to-end Speech Recognition Models For Sentiment Analysis
2019 Β· Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, et al.
Abstract
Previous work on emotion recognition demonstrated a synergistic effect of combining several modalities such as auditory, visual, and transcribed text to estimate the affective state of a speaker. Among these, the linguistic modality is crucial for the evaluation of an expressed emotion. However, manually transcribed spoken text cannot be given as input to a system practically. We argue that using ground-truth transcriptions during training and evaluation phases leads to a significant discrepancy in performance compared to real-world conditions, as the spoken text has to be recognized on the fly and can contain speech recognition mistakes. In this paper, we propose a method of integrating an automatic speech recognition (ASR) output with a character-level recurrent neural network for sentiment recognition. In addition, we conduct several experiments investigating sentiment recognition for human-robot interaction in a noise-realistic scenario which is challenging for the ASR systems. We
Authors
(none)
Tags
Stats
Related papers
- Fusing ASR Outputs In Joint Training For Speech Emotion Recognition (2021)12.61
- CTA-RNN: Channel And Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings For Speech Emotion Recognition (2022)5.84
- ASR And Emotional Speech: A Word-level Investigation Of The Mutual Impact Of Speech And Emotion Recognition (2023)8.82
- Extending Rnn-t-based Speech Recognition Systems With Emotion And Language Classification (2022)4.52
- Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, And Augmenting (2023)0.00
- Integrating Emotion Recognition With Speech Recognition And Speaker Diarisation For Conversations (2023)7.16
- Asr-based Features For Emotion Recognition: A Transfer Learning Approach (2018)9.76
- On The Impact Of Word Error Rate On Acoustic-linguistic Speech Emotion Recognition: An Update For The Deep Learning Era (2021)0.00