← all datasets

WMT-24

Emerging
3papers using it
2025first seen

The WMT-24++ dataset is a benchmark that contains translations across 54 language pairs and is used to evaluate the literality of human translations and machine translation systems, including their performance in direct translation, iterative self-revision, and post-editing tasks.

Papers using WMT-24 (3)

WMT-24 β€” datasets β€” llm-papers