← all datasets

EIFBENCH

Emerging
4papers using it
2025first seen

EIFBENCH is a benchmark designed to evaluate large language models on their ability to follow extremely complex instructions in multi-task scenarios with various constraints, reflecting real-world operational environments.

Papers using EIFBENCH (4)

EIFBENCH β€” datasets β€” llm-papers