Japanese dataset

Emerging

5papers using it

2024first seen

The 'Japanese dataset' is a collection of text data without explicit word boundaries used to evaluate the performance of the CC-G2PnP model in predicting phonemic and prosodic labels.

🔎 Find this dataset

Papers using Japanese dataset (5)

LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context2025 · 3 cites

CC-G2PnP: Streaming Grapheme-to-Phoneme and prosody with Conformer-CTC for unsegmented languages2026

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices2025

Singmos: An Extensive Open-source Singing Voice Dataset For MOS Prediction2024 · 3 cites

Contextualized Automatic Speech Recognition with Dynamic Vocabulary2024