← all datasets

BigGen Bench

Emerging
3papers using it
154HF downloads
17HF likes
2025first seen

BIGGEN-Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models Dataset Description BIGGEN-Bench (BiG Generation Benchmark) is a comprehensive evaluation benchmark designed to assess the capabilities of large language models (LLMs) across a wide range of tasks. This benchmark focuses on free-form te

Papers using BigGen Bench (3)

BigGen Bench β€” datasets β€” llm-papers