← all datasets

LongBench

Emerging
31papers using it
69,173HF downloads
181HF likes
2024first seen

LongBench is a comprehensive benchmark for multilingual and multi-task purposes, with the goal to fully measure and evaluate the ability of pre-trained language models to understand long text. This dataset consists of twenty different tasks, covering key long-text application scenarios such as multi-document QA, single

Papers using LongBench (29)

LongBench β€” datasets β€” llm-papers