← all datasets

FLEURS

Canonical

52papers using it

2022first seen

FLEURS is a benchmark used to evaluate speech-to-text translation performance across multiple languages, including the assessment of translation quality in the context of the KUTED dataset for Central Kurdish.

🔎 Find this dataset

Papers using FLEURS (52)

FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System2026

Whispering in Amharic: Fine-tuning Whisper for Low-resource Language2025 · 7 cites

Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages2025 · 5 cites

Language-Aware Prompt Tuning for Parameter-Efficient Seamless Language Expansion in Multilingual ASR2025 · 4 cites

Bandwidth-Efficient and Privacy-Preserving Edge-Cloud Many-to-Many Speech Translation2026

Using Songs to Improve Kazakh Automatic Speech Recognition2026

Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition2025 · 1 cites

SN-WER: Script-Normalized WER for Multi-Script Indic ASR Evaluation2026

Benchmarking Speech-to-Speech Translation Models2026

A Comparative Study of Pre-trained Speech Encoders and Training Objectives for Large-Scale Indic Spoken Language Identification2026

PiDA: Phonetically-Informed Data Augmentation for Robust Vietnamese Speech Translation2026

Sometin Beta Pass Notin (SBPN): Improving Multilingual ASR for Nigerian Languages via Knowledge Distillation2026

English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization2026

Script Collapse in Multilingual ASR: Defining and Measuring Script Fidelity Rate2026

VietSuperSpeech: A Large-Scale Vietnamese Conversational Speech Dataset for ASR Fine-Tuning in Chatbot, Customer Support, and Call Center Applications2026

Two-Stage Adaptation for Non-Normative Speech Recognition: Revisiting Speaker-Independent Initialization for Personalization2026

FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish2026

MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages2025

POTSA: A Cross-Lingual Speech Alignment Framework for Speech-to-Text Translation2025

PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs2025

DQLoRA: A Lightweight Domain-Aware Denoising ASR via Adapter-guided Distillation2025

Training-Free Voice Conversion with Factorized Optimal Transport2025

Improving Language and Modality Transfer in Translation by Character-level Modeling2025

On the use of Performer and Agent Attention for Spoken Language Identification2025

AfriHuBERT: A self-supervised speech representation model for African languages2024

Scaling Speech Technology to 1,000+ Languages2023 · 116 cites

Improving Massively Multilingual ASR With Auxiliary CTC Objectives2023 · 26 cites

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech2022 · 15 cites

SeamlessM4T: Massively Multilingual & Multimodal Machine Translation2023 · 13 cites

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study2024 · 7 cites

A Compact End-to-End Model with Local and Global Context for Spoken Language Identification2022 · 4 cites

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations2022 · 4 cites

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement2024 · 2 cites

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation2024 · 1 cites

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR2022

Label Aware Speech Representation Learning For Language Identification2023

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer2023

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation2023

Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification2023

Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges2024

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators2024

Africa-Centric Self-Supervised Pre-Training for Multilingual Speech Representation in a Sub-Saharan Context2024

ASTRA: Aligning Speech and Text Representations for Asr without Sampling2024

Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR2024

Investigating Decoder-only Large Language Models for Speech-to-text Translation2024

FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks2024

ASR Benchmarking: Need for a More Representative Conversational Dataset2024

EMMeTT: Efficient Multimodal Machine Translation Training2024

Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking2024

Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning2024

DENOASR: Debiasing ASRs through Selective Denoising2024

Whisper Finetuning on Nepali Language2024

FLEURS dataset — papers, benchmarks & downloads · Speech Audio