LLMBar
Emerging2papers using it
106HF downloads
4HF likes
2025first seen
LLMBar is a challenging meta-evaluation benchmark designed to test the ability of an LLM evaluator in discerning instruction-following outputs. LLMBar consists of 419 instances, where each entry contains an instruction paired with two outputs: one faithfully and correctly follows the instruction and the other deviates
π€ Hugging Faceβ mit