Chatbot Arena
Canonical8papers using it
2024first seen
Papers using Chatbot Arena (8)
- SCOPE: Selective Conformal Optimized Pairwise LLM JudgingDropping Just a Handful of Preferences Can Change Top Large Language Model RankingsBridging Human and LLM Judgments: Understanding and Narrowing the GapSynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMsSynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward
Models in LLMsDecentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsInvestigating Non-Transitivity in LLM-as-a-JudgeA Statistical Framework for Ranking LLM-Based Chatbots