Dolly
Emerging1papers using it
2026first seen
The 'Dolly' dataset is a benchmark used to evaluate the safety and compliance behavior of large language models (LLMs) during benign instruction fine-tuning.
The 'Dolly' dataset is a benchmark used to evaluate the safety and compliance behavior of large language models (LLMs) during benign instruction fine-tuning.