← all datasets

GPQA

Emerging
25papers using it
114,728HF downloads
461HF likes
2025first seen

Dataset Card for GPQA GPQA is a multiple-choice, Q&A dataset of very hard questions written and validated by experts in biology, physics, and chemistry. When attempting questions out of their own domain (e.g., a physicist answers a chemistry question), these experts get only 34% accuracy, despite spending >30m with ful

Papers using GPQA (25)

GPQA β€” datasets β€” llm-papers