AlpacaEval
Emerging1papers using it
2026first seen
AlpacaEval is a benchmark dataset used to evaluate the performance of large language models in terms of their robustness against various attacks.
AlpacaEval is a benchmark dataset used to evaluate the performance of large language models in terms of their robustness against various attacks.