← all datasets

AlpacaEval

Emerging
1papers using it
2026first seen

AlpacaEval is a benchmark dataset used to evaluate the performance of large language models in terms of their robustness against various attacks.

Papers using AlpacaEval (1)

AlpacaEval β€” datasets β€” cybersecurity