Semantic Communications For Speech Recognition
2021 Β· Zhenzi Weng, Zhijin Qin, Geoffrey Ye Li
Abstract
The traditional communications transmit all the source data represented by bits, regardless of the content of source and the semantic information required by the receiver. However, in some applications, the receiver only needs part of the source data that represents critical semantic information, which prompts to transmit the application-related information, especially when bandwidth resources are limited. In this paper, we consider a semantic communication system for speech recognition by designing the transceiver as an end-to-end (E2E) system. Particularly, a deep learning (DL)-enabled semantic communication system, named DeepSC-SR, is developed to learn and extract text-related semantic features at the transmitter, which motivates the system to transmit much less than the source speech data without performance degradation. Moreover, in order to facilitate the proposed DeepSC-SR for dynamic channel environments, we investigate a robust model to cope with various channel environments
Authors
(none)
Tags
Stats
Related papers
- Semantic Communications For Speech Signals (2020)14.35
- Deep Learning Enabled Semantic Communications With Speech Recognition And Synthesis (2022)17.85
- Rate-adaptive Coding Mechanism For Semantic Communications With Multi-modal Data (2023)11.93
- Robust Semantic Communications For Speech Transmission (2024)0.00
- From Audio To Semantics: Approaches To End-to-end Spoken Language Understanding (2018)13.23
- An Investigation Of End-to-end Multichannel Speech Recognition For Reverberant And Mismatch Conditions (2019)0.00
- Label-synchronous Neural Transducer For Adaptable Online E2E Speech Recognition (2023)3.58
- Knowledge Transfer From Large-scale Pretrained Language Models To End-to-end Speech Recognizers (2022)9.41