LLMBar

Name: LLMBar
License: mit

Emerging

2papers using it

106HF downloads

4HF likes

2025first seen

LLMBar is a challenging meta-evaluation benchmark designed to test the ability of an LLM evaluator in discerning instruction-following outputs. LLMBar consists of 419 instances, where each entry contains an instruction paired with two outputs: one faithfully and correctly follows the instruction and the other deviates

🤗 Hugging Face⚖ mit

Papers using LLMBar (2)

Does the Judge Prefer English? Evaluating Language-Switching Invariance in LLM-as-a-Judge2026

Are We on the Right Way to Assessing LLM-as-a-Judge?2025