← all datasets

RULER

Emerging
3papers using it
1,939HF downloads
8HF likes
2025first seen

This is a synthetic dataset generated using πŸ“ RULER: What’s the Real Context Size of Your Long-Context Language Models?. It can be used to evaluate long-context language models with configurable sequence length and task complexity. Currently, It includes 4 tasks from RULER: QA2 (hotpotqa after adding distracting infor

Papers using RULER (3)

RULER β€” datasets β€” ai-agents