Open ASR Leaderboard: Towards Reproducible And Transparent Multilingual And Long-form Speech Recognition Evaluation
2025 Β· Vaibhav Srivastav, Steven Zheng, Eric Bezzam, et al.
Abstract
We present the Open ASR Leaderboard, a reproducible benchmarking platform with community contributions from academia and industry. It compares 86 open-source and proprietary systems across 12 datasets, with English short- and long-form and multilingual short-form tracks. We standardize word error rate (WER) and inverse real-time factor (RTFx) evaluation for consistent accuracy-efficiency comparisons across model architectures and toolkits (e.g., ESPNet, NeMo, SpeechBrain, Transformers). We observe that Conformer-based encoders paired with transformer-based decoders achieve the best average WER, while connectionist temporal classification (CTC) and token-and-duration transducer (TDT) decoders offer superior RTFx, making them better suited for long-form and batched processing. All code and dataset loaders are open-sourced to support transparent, extensible evaluation. We present our evaluation methodology to facilitate community-driven benchmarking in ASR and other tasks.
Authors
(none)
Tags
Stats
Related papers
- Speechcolab Leaderboard: An Open-source Platform For Automatic Speech Recognition Evaluation (2024)9.05
- Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR (2020)8.35
- Investigating End-to-end ASR Architectures For Long Form Audio Transcription (2023)6.34
- Lebenchmark: A Reproducible Framework For Assessing Self-supervised Representation Learning From Speech (2021)11.39
- A Comparison Of End-to-end Models For Long-form Speech Recognition (2019)12.93
- A Reference-less Quality Metric For Automatic Speech Recognition Via Contrastive-learning Of A Multi-language Model With Self-supervision (2023)2.51
- Toward Practical Automatic Speech Recognition And Post-processing: A Call For Explainable Error Benchmark Guideline (2024)0.00
- Exploring The Limits Of Decoder-only Models Trained On Public Speech Recognition Corpora (2024)4.52