← all datasets

AlpacaEval 2.0

Emerging
16papers using it
2025first seen

'AlpacaEval 2.0' is a dataset/benchmark used to evaluate the alignment of Large Language Models (LLMs) with human preferences through the analysis of generated responses.

Papers using AlpacaEval 2.0 (16)

AlpacaEval 2.0 β€” datasets β€” llm-papers