← all datasets

200-task primary dataset

Emerging
1papers using it
2026first seen

The '200-task primary dataset' contains a diverse set of coding tasks used to evaluate the security reliability of large language models in code generation across different programming languages and prompting strategies.

Papers using 200-task primary dataset (1)

200-task primary dataset β€” datasets β€” cybersecurity