Exploring In-context Learning Of Textless Speech Language Model For Speech Classification Tasks
2023 Β· Ming-Hao Hsu, Kai-Wei Chang, Shang-Wen Li, et al.
Abstract
Ever since the development of GPT-3 in the natural language processing (NLP) field, in-context learning (ICL) has played an essential role in utilizing large language models (LLMs). By presenting the LM utterance-label demonstrations at the input, the LM can accomplish few-shot learning without relying on gradient descent or requiring explicit modification of its parameters. This enables the LM to perform various downstream tasks in a black-box manner. Despite the success of ICL in NLP, little work is exploring the possibility of ICL in speech processing. This study is the first work exploring ICL for speech classification tasks with textless speech LM. We first show that the current speech LM lacks the ICL capability. We then perform warmup training on the speech LM, equipping the LM with demonstration learning capability. This paper explores and proposes the first speech LM capable of performing unseen classification tasks in an ICL manner.
Authors
(none)
Tags
Stats
Related papers
- Disentangling The Prosody And Semantic Information With Pre-trained Model For In-context Learning Based Zero-shot Voice Conversion (2024)4.52
- End-to-end Speech Recognition Contextualization With Large Language Models (2023)0.00
- Joint Unsupervised And Supervised Learning For Context-aware Language Identification (2023)2.26
- Attention-based Contextual Language Model Adaptation For Speech Recognition (2021)0.00
- Label-context-dependent Internal Language Model Estimation For CTC (2025)0.00
- Bayesian Example Selection Improves In-context Learning For Speech, Text, And Visual Modalities (2024)5.84
- Exploring The Integration Of Large Language Models Into Automatic Speech Recognition Systems: An Empirical Study (2023)8.09
- Clapspeech: Learning Prosody From Text Context With Contrastive Language-audio Pre-training (2023)0.00