Cybench

Emerging

3papers using it

2025first seen

Cybench is a benchmark that evaluates the performance of language model agents in finding vulnerabilities through a set of challenges designed to assess their capabilities in software security tasks.

🔎 Find this dataset

Papers using Cybench (3)

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo2025

Cyber-Zero: Training Cybersecurity Agents without Runtime2025

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo2025