RMCBench
Emerging3papers using it
2024first seen
RMCBench is a benchmark consisting of 473 prompts used to evaluate the ability of Large Language Models to resist the generation of malicious code through text-to-code and code-to-code scenarios.
RMCBench is a benchmark consisting of 473 prompts used to evaluate the ability of Large Language Models to resist the generation of malicious code through text-to-code and code-to-code scenarios.