Ο-Bench
Emerging5papers using it
2025first seen
Ξ¨-Bench is a benchmark designed to evaluate the ability of language models to influence users through persuasive dialogues, incorporating personalized user profiles derived from dialogue histories.
Papers using Ο-Bench (5)
- Goal Alignment in LLM-Based User Simulators for Conversational AIReinforcement World Model Learning for LLM-based AgentsLRanker: LLM Ranker for Massive CandidatesΨ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive DialoguesPlanner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs