← all datasets

CoVoST 2

Emerging

28papers using it

2021first seen

CoVoST-2 is a multilingual speech-to-text translation dataset used to evaluate machine translation systems across multiple languages.

🔎 Find this dataset

Papers using CoVoST 2 (28)

Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison2025 · 1 cites

Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation2026

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion2026

PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs2025

SASST: Leveraging Syntax-Aware Chunking and LLMs for Simultaneous Speech Translation2025

Speech Translation Refinement using Large Language Models2025

MAESTRO: Matched Speech Text Representations through Modality Matching2022 · 71 cites

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training2021 · 50 cites

Pre-training for Speech Translation: CTC Meets Optimal Transport2023 · 7 cites

Simple and Effective Unsupervised Speech Translation2022 · 2 cites

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation2023 · 2 cites

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation2023 · 2 cites

Compact Speech Translation Models via Discrete Speech Units Pretraining2024 · 2 cites

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation2022 · 1 cites

Towards a Deep Understanding of Multilingual End-to-End Speech Translation2023 · 1 cites

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models2024 · 1 cites

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation2024 · 1 cites

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale2021

Improved Cross-Lingual Transfer Learning For Automatic Speech Translation2023

Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts2023

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators2024

Investigating Decoder-only Large Language Models for Speech-to-text Translation2024

Task Arithmetic for Language Expansion in Speech Translation2024

Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning2024

Representation Purification for End-to-End Speech Translation2024

Zero-resource Speech Translation and Recognition with LLMs2024

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation2022

Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation2022

CoVoST 2 dataset — papers, benchmarks & downloads · Speech Audio