← all datasets

Belebele

Emerging
3papers using it
26,809HF downloads
128HF likes
2024first seen

The Belebele Benchmark for Massively Multilingual NLU Evaluation Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. This dataset enables the evaluation of mono- and multi-lingual models in high-, medium-, and low-resource languages. Each question has four multiple-

Papers using Belebele (3)

Belebele β€” datasets β€” llm-papers