← all datasets

ProfBench

Emerging
2papers using it
597HF downloads
29HF likes
2025first seen

Dataset Description: Leaderboard | Blog | Paper | Data | Code | Nemo Evaluator SDK More than 3000 rubric-response pairs across 40 human-annotated tasks presenting reports addressing professional tasks across PhD STEM (Chemistry, Physics) and Professional Services (Financial Services, Management Consulting) domains. Thi

Papers using ProfBench (2)

ProfBench β€” datasets β€” ai-agents