← all datasets

LLMBar

Emerging
2papers using it
106HF downloads
4HF likes
2025first seen

LLMBar is a challenging meta-evaluation benchmark designed to test the ability of an LLM evaluator in discerning instruction-following outputs. LLMBar consists of 419 instances, where each entry contains an instruction paired with two outputs: one faithfully and correctly follows the instruction and the other deviates

Papers using LLMBar (2)

LLMBar β€” datasets β€” llm-papers