← all datasets

HealthBench

Emerging
2papers using it
134HF downloads
5HF likes
2026first seen

THE CODE IS CURRENTLY BROKEN BUT THE DATASET IS GOOD!! HealthBench Implementation for using Opensource Judges Easy-to-use implementation of OpenAI's HealthBench evaluation benchmark with support for any OpenAI API-compatible model as both the system under test and the judge. Developed by: Nisten Tahiraj / OnDeviceMednotes License: MIT Paper: HealthBench: Evaluating Large Language Models Towards Improved Human Health Overview This repository contains tools… See the full description on the dataset page: https://huggingface.co/datasets/OnDeviceMedNotes/healthbench.

HealthBench β€” datasets β€” ai-for-code