Japanese dataset
Emerging5papers using it
2024first seen
The 'Japanese dataset' is a collection of text data without explicit word boundaries used to evaluate the performance of the CC-G2PnP model in predicting phonemic and prosodic labels.
Papers using Japanese dataset (5)
- LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic ContextCC-G2PnP: Streaming Grapheme-to-Phoneme and prosody with Conformer-CTC for unsegmented languagesFlavors of Moonshine: Tiny Specialized ASR Models for Edge DevicesSingmos: An Extensive Open-source Singing Voice Dataset For MOS PredictionContextualized Automatic Speech Recognition with Dynamic Vocabulary