← all datasets

RMCBench

Emerging
3papers using it
2024first seen

RMCBench is a benchmark consisting of 473 prompts used to evaluate the ability of Large Language Models to resist the generation of malicious code through text-to-code and code-to-code scenarios.

Papers using RMCBench (3)

RMCBench β€” datasets β€” ai-for-code