MuST-C
Emerging56papers using it
2022first seen
MuST-C is a dataset used to evaluate simultaneous speech-to-speech translation across multiple languages, focusing on the quality-latency trade-off in long-form continuous speech scenarios.
Papers using MuST-C (53)
- STEMM: Self-learning With Speech-text Manifold Mixup For Speech TranslationSpeechut: Bridging Speech And Text With Hidden-unit For Encoder-decoder Based Speech-text Pre-trainingEfficient Sequence Transduction By Jointly Predicting Tokens And DurationsSimulU: Training-free Policy for Long-form Simultaneous Speech-to-Speech TranslationRethinking And Improving Multi-task Learning For End-to-end Speech TranslationAn Empirical Study Of Consistency Regularization For End-to-end Speech-to-text TranslationBridging The Gaps Of Both Modality And Language: Synchronous Bilingual CTC For Speech Translation And Speech RecognitionOptimal Multi-Task Learning at Regularization Horizon for Speech Translation TaskOptimizing Speech Multi-view Feature Fusion Through Conditional ComputationInfiniSST: Simultaneous Translation of Unbounded Speech with Large Language ModelOptimizing Speech Multi-View Feature Fusion through Conditional
ComputationSpeech Translation Refinement using Large Language ModelsImproving End-to-end Speech Translation By Imitation-based Knowledge Distillation With Synthetic TranscriptsAdatrans: Adapting With Boundary-based Shrinking For End-to-end Speech TranslationImplicit Memory Transformer For Computationally Efficient Simultaneous Speech TranslationSpeech Translation With Foundation Models And Optimal Transport: UPC At IWSLT23Shiftable Context: Addressing Training-inference Context Mismatch In Simultaneous Speech TranslationM3ST: Mix at Three Levels for Speech TranslationPre-training for Speech Translation: CTC Meets Optimal TransportEfficient Sequence Transduction by Jointly Predicting Tokens and
DurationsSpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder
Based Speech-Text Pre-trainingSimple and Effective Unsupervised Speech TranslationAdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech
TranslationWACO: Word-Aligned Contrastive Learning for Speech TranslationTuning Large language model for End-to-end Speech TranslationEfficient Speech Translation with Dynamic Latent PerceiversSegAugment: Maximizing the Utility of Speech Translation Data with
Segmentation-based AugmentationsHybrid Transducer and Attention based Encoder-Decoder Modeling for
Speech-to-Text TasksUnderstanding and Bridging the Modality Gap for Speech TranslationCMOT: Cross-modal Mixup via Optimal Transport for Speech TranslationSoft Alignment of Modality Space for End-to-end Speech TranslationFASST: Fast LLM-based Simultaneous Speech TranslationGenerating Synthetic Speech from SpokenVocab for Speech TranslationRedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech
Translation without Quality CompromiseDecouple Non-parametric Knowledge Distillation For End-to-end Speech
TranslationImproving speech translation by fusing speech and textCTC-based Non-autoregressive Speech TranslationSpeech Translation with Foundation Models and Optimal Transport: UPC at
IWSLT23Modality Adaption or Regularization? A Case Study on End-to-End Speech
TranslationShiftable Context: Addressing Training-Inference Context Mismatch in
Simultaneous Speech TranslationImplicit Memory Transformer for Computationally Efficient Simultaneous
Speech TranslationImproving End-to-End Speech Translation by Imitation-Based Knowledge
Distillation with Synthetic TranscriptsAn Empirical Study of Consistency Regularization for End-to-End
Speech-to-Text TranslationBridging the Gaps of Both Modality and Language: Synchronous Bilingual
CTC for Speech Translation and Speech RecognitionCross-Modal Multi-Tasking for Speech-to-Text Translation via Hard
Parameter SharingRethinking and Improving Multi-task Learning for End-to-end Speech
TranslationPushing the Limits of Zero-shot End-to-End Speech TranslationSimulTron: On-Device Simultaneous Speech to Speech TranslationTask Arithmetic for Language Expansion in Speech TranslationRepresentation Purification for End-to-End Speech TranslationOn the Impact of Noises in Crowd-Sourced Data for Speech TranslationImproving Speech Translation by Cross-Modal Multi-Grained Contrastive
LearningIncremental Blockwise Beam Search for Simultaneous Speech Translation
with Controllable Quality-Latency Tradeoff