Dolly

Emerging

1papers using it

2026first seen

The 'Dolly' dataset is a benchmark used to evaluate the safety and compliance behavior of large language models (LLMs) during benign instruction fine-tuning.

🔎 Find this dataset

Papers using Dolly (1)

DataShield: Safety-degrading Data Filtering for LLM Benign Instruction Fine-Tuning2026