curated dataset of 4,070 packages
Emerging1papers using it
2026first seen
The curated dataset of 4,070 packages contains 3,700 benign and 370 malicious software packages and is used to evaluate the performance of Large Language Models (LLMs) in detecting malicious packages and identifying specific malicious indicators.