#ModelWERPaper
1Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR2.80β€”
2Language model integration based on memory control for sequence to sequence speech recognition3.70β€”
3Language Model Integration Based On Memory Control For Sequence To Sequence Speech Recognition3.70β€”
4Whisfusion: Parallel ASR Decoding via a Diffusion Transformer8.30β€”
5Segaug: Ctc-aligned Segmented Augmentation For Robust Rnn-transducer Based Speech Recognition12.50β€”
6SegAug: CTC-Aligned Segmented Augmentation For Robust RNN-Transducer Based Speech Recognition12.50β€”
7Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering27.73β€”
Librispeech librispeech Leaderboard