← all datasets

AlpacaEval 2.0

Emerging

16papers using it

2025first seen

'AlpacaEval 2.0' is a dataset/benchmark used to evaluate the alignment of Large Language Models (LLMs) with human preferences through the analysis of generated responses.

🔎 Find this dataset

Papers using AlpacaEval 2.0 (16)

This Is Your Doge, If It Please You: Exploring Deception And Robustness In Mixture Of Llms2025 · 2 cites

Label-Free Reinforcement Learning via Cross-Model Entropy2026

TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection2025 · 1 cites

MMoA: An AI-Agent framework with recurrence for Memoried Mixure-of-Agent2026

S-SPPO: Semantic-Calibrated Self-Play Preference Optimization2026

Aligning Large Language Models via Fully Self-Synthetic Data2025

Icon$^{2}$: Aligning Large Language Models Using Self-Synthetic Preference Data via Inherent Regulation2025

SGPO: Self-Generated Preference Optimization based on Self-Improver2025

Unlocking Recursive Thinking of LLMs: Alignment via Refinement2025

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model2025

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge2025

Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future2025

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs2025

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?2025

Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis2025

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings2025

AlpacaEval 2.0 — datasets — llm-papers