Incremental Layer-wise Self-supervised Learning For Efficient Speech Domain Adaptation On Device
2021 Β· Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, et al.
Abstract
Streaming end-to-end speech recognition models have been widely applied to mobile devices and show significant improvement in efficiency. These models are typically trained on the server using transcribed speech data. However, the server data distribution can be very different from the data distribution on user devices, which could affect the model performance. There are two main challenges for on device training, limited reliable labels and limited training memory. While self-supervised learning algorithms can mitigate the mismatch between domains using unlabeled data, they are not applicable on mobile devices directly because of the memory constraint. In this paper, we propose an incremental layer-wise self-supervised learning algorithm for efficient speech domain adaptation on mobile devices, in which only one layer is updated at a time. Extensive experimental results demonstrate that the proposed algorithm obtains a Word Error Rate (WER) on the target domain \(24.2%\) better than s
Authors
(none)
Tags
Stats
Related papers
- Automatic Data Augmentation For Domain Adapted Fine-tuning Of Self-supervised Speech Representations (2023)0.00
- Efficient Adapter Transfer Of Self-supervised Speech Models For Automatic Speech Recognition (2022)12.68
- How To Learn A New Language? An Efficient Solution For Self-supervised Learning Models Unseen Languages Adaption In Low-resource Scenario (2024)0.00
- Boosting Cross-domain Speech Recognition With Self-supervision (2022)0.00
- Self-supervised Learning For Speech Recognition With Intermediate Layer Supervision (2021)9.41
- Self-supervised Learning Based Domain Adaptation For Robust Speaker Verification (2021)11.49
- Multi-domain Adaptation By Self-supervised Learning For Speaker Verification (2023)0.00
- Fast Contextual Adaptation With Neural Associative Memory For On-device Personalized Speech Recognition (2021)9.76