IFEval

Name: IFEval
License: apache-2.0

Canonical

12papers using it

93,921HF downloads

151HF likes

2024first seen

Dataset Card for IFEval Dataset Summary This dataset contains the prompts used in the Instruction-Following Eval (IFEval) benchmark for large language models. It contains around 500 "verifiable instructions" such as "write in more than 400 words" and "mention the keyword of AI at least 3 times" which can be verified by

🤗 Hugging Face⚖ apache-2.0

Papers using IFEval (12)

PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling2025 · 4 cites

The Price of Format: Diversity Collapse in LLMs2025 · 2 cites

Boosting Large Language Models with Mask Fine-Tuning2025

Revisiting the Reliability of Language Models in Instruction-Following2025

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models2025

Sample, Don't Search: Rethinking Test-Time Alignment for Language Models2025

MM-IFEngine: Towards Multimodal Instruction Following2025

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models2025

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards2025

Effectively Controlling Reasoning Models through Thinking Intervention2025

M-IFEval: Multilingual Instruction-Following Evaluation2025

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following2024 · 4 cites