AlpacaEval
Canonical10papers using it
20,423HF downloads
65HF likes
2024first seen
Data for alpaca_eval, which aims to help automatic evaluation of instruction-following models
π€ Hugging Faceβ cc-by-nc-4.0
Papers using AlpacaEval (10)
- Token-weighted Direct Preference Optimization with AttentionReferences Improve LLM Alignment in Non-Verifiable DomainsRefine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-TuningP3: Prompts Promote PromptingAlignment Data Map for Efficient Preference Data Selection and DiagnosisImplicit Cross-Lingual Rewarding for Efficient Multilingual Preference AlignmentXL-Suite: Cross-Lingual Synthetic Training and Evaluation Data for Open-Ended GenerationSentence-level Reward Model can Generalize Better for Aligning LLM from Human PreferenceInvestigating Non-Transitivity in LLM-as-a-JudgePermutative Preference Alignment from Listwise Ranking of Human Judgments