End-to-end Named Entity Extraction From Speech
2018 · Sahar Ghannay, Antoine Caubrière, Yannick Estève, et al.
Abstract
Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-optimal in regards to the final task, reduced space search at the ASR output level...) and it is known that more integrated approaches outperform sequential ones, when they can be applied. In this paper, we present a first study of end-to-end approach that directly extracts named entities from speech, though a unique neural architecture. On a such way, a joint optimization is able for both ASR and NER. Experiments are carried on French data easily accessible, composed of data distributed in several evaluation campaign. Experimental results show that this end-to-end approach provides better result
Authors
(none)
Tags
Stats
Related papers
- End-to-end Model For Named Entity Recognition From Speech Without Paired Training Data (2022)6.77
- Recent Advances In End-to-end Spoken Language Understanding (2019)8.09
- On The Use Of External Data For Spoken Named Entity Recognition (2021)6.77
- Leveraging Cross-lingual Transfer Learning In Spoken Named Entity Recognition Systems (2023)0.00
- Where Are We In Semantic Concept Extraction For Spoken Language Understanding? (2021)5.84
- "i've Heard Of You!": Generate Spoken Named Entity Recognition Data For Unseen Entities (2024)2.26
- Enriching Under-represented Named-entities To Improve Speech Recognition Performance (2020)0.00
- Speech To Semantics: Improve ASR And NLU Jointly Via All-neural Interfaces (2020)9.03