← all datasets

Tamil

Emerging
6papers using it
2024first seen

The 'Tamil' dataset/benchmark contains speech data used to evaluate automatic speech recognition (ASR) systems through fine-grained, Part-of-Speech (PoS)-wise error analysis, particularly focusing on the alignment of ASR hypotheses and reference transcriptions in non-Latin scripts.

Papers using Tamil (6)

Tamil β€” datasets β€” speech-audio