VoxPopuli
Emerging14papers using it
2022first seen
VoxPopuli is a benchmark dataset used to evaluate the performance of Automatic Speech Recognition (ASR) models, specifically in the context of the Polish language.
Papers using VoxPopuli (14)
- Joint Pre-training With Speech And Bilingual Text For Direct Speech To Speech TranslationDynamic Chunk Convolution For Unified Streaming And Non-streaming Conformer ASRQuality of Automatic Speech Recognition -- Polish Language case study -- from Wav2Vec to Scribe ElevenLabsSpeech Vecalign: an Embedding-based Method for Aligning Parallel Speech DocumentsCMU's IWSLT 2025 Simultaneous Speech Translation SystemOn the use of Performer and Agent Attention for Spoken Language
IdentificationTowards Personalization Of CTC Speech Recognition Models With Contextual Adapters And Adaptive BoostingMu$^{2}$SLAM: Multitask, Multilingual Speech and Language ModelsSpeechMatrix: A Large-Scale Mined Corpus of Multilingual
Speech-to-Speech TranslationsJoint Pre-Training with Speech and Bilingual Text for Direct Speech to
Speech TranslationTowards Personalization of CTC Speech Recognition Models with Contextual
Adapters and Adaptive BoostingDynamic Chunk Convolution for Unified Streaming and Non-Streaming
Conformer ASRDENOASR: Debiasing ASRs through Selective DenoisingExploring Capabilities of Monolingual Audio Transformers using Large
Datasets in Automatic Speech Recognition of Czech