speechocean-762
Emerging12papers using it
2022first seen
The 'speechocean762' dataset is a benchmark used to evaluate Automatic Speech Recognition (ASR) models, containing diverse speech samples for assessing pronunciation quality and feature extraction capabilities.
Papers using speechocean-762 (12)
- Speechblender: Speech Augmentation Framework For Mispronunciation Data GenerationAcoustic Feature Mixup For Balanced Multi-aspect Pronunciation AssessmentGoodness-of-pronunciation without phoneme time alignmentZero-Shot Speech LLMs for Multi-Aspect Evaluation of L2 Speech: Challenges and OpportunitiesEnglish Pronunciation Evaluation without Complex Joint Training: LoRA Fine-tuned Speech Multimodal LLMCBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency AssessmentZero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation AssessmentSpeechBlender: Speech Augmentation Framework for Mispronunciation Data
GenerationA Hierarchical Context-aware Modeling Approach for Multi-aspect and
Multi-granular Pronunciation AssessmentZero-Shot Automatic Pronunciation AssessmentL1-aware Multilingual Mispronunciation Detection FrameworkAcoustic Feature Mixup for Balanced Multi-aspect Pronunciation
Assessment