WMT-24
Emerging3papers using it
2025first seen
The WMT-24++ dataset is a benchmark that contains translations across 54 language pairs and is used to evaluate the literality of human translations and machine translation systems, including their performance in direct translation, iterative self-revision, and post-editing tasks.