τ-Bench

Emerging

5papers using it

2025first seen

Ψ-Bench is a benchmark designed to evaluate the ability of language models to influence users through persuasive dialogues, incorporating personalized user profiles derived from dialogue histories.

🔎 Find this dataset

Papers using τ-Bench (5)

Goal Alignment in LLM-Based User Simulators for Conversational AI2025 · 14 cites

Reinforcement World Model Learning for LLM-based Agents2026

LRanker: LLM Ranker for Massive Candidates2026

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues2026

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs2025