Speechocean-762

Emerging

13papers using it

2022first seen

The 'Speechocean-762' dataset contains 5,000 utterances used to evaluate L2 English pronunciation across multiple aspects, including accuracy, fluency, prosody, and completeness.

🔎 Find this dataset

Papers using Speechocean-762 (13)

A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales2026

Goodness-of-pronunciation without phoneme time alignment2026

Zero-Shot Speech LLMs for Multi-Aspect Evaluation of L2 Speech: Challenges and Opportunities2026

English Pronunciation Evaluation without Complex Joint Training: LoRA Fine-tuned Speech Multimodal LLM2025

CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment2025

Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment2024

Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning2022 · 38 cites

SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation2022 · 2 cites

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment2023

Zero-Shot Automatic Pronunciation Assessment2023

L1-aware Multilingual Mispronunciation Detection Framework2023

Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment2024

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment2022