RBench
Emerging7papers using it
57HF downloads
0HF likes
2025first seen
RBench is a benchmark dataset used to evaluate ranking performance across various tasks and candidate scales, specifically designed to assess the effectiveness of ranking models like LRanker in handling large candidate pools.
Papers using RBench (7)
- Goal Alignment in LLM-Based User Simulators for Conversational AIReinforcement World Model Learning for LLM-based AgentsPaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward ModelingLRanker: LLM Ranker for Massive CandidatesCDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward ModelingΨ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive DialoguesPlanner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs