HealthBench
Emerging7papers using it
2,966HF downloads
147HF likes
2025first seen
Contains the data for the HealthBench eval. For the reference implementation of HealthBench, see OpenAI's simple-evals repo.
π€ Hugging Faceβ mit
Papers using HealthBench (7)
- Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended ReasoningAlternating Reinforcement Learning with Contextual Rubric RewardsRubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine GenerationRubrics as Rewards: Reinforcement Learning Beyond Verifiable DomainsBaichuan-M2: Scaling Medical Capability with Large Verifier SystemDoctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement LearningMultidimensional Rubric-oriented Reward Model Learning via Geometric Projection Reference Constraints