HealthBench-Hard
Emerging3papers using it
92HF downloads
0HF likes
2025first seen
HealthBench-Hard is a benchmark used to evaluate the alignment of large language models with clinician preferences in healthcare contexts.
π€ Hugging Faceβ mit
HealthBench-Hard is a benchmark used to evaluate the alignment of large language models with clinician preferences in healthcare contexts.