← all datasets

Nemotron-CC

Emerging
2papers using it
20,598HF downloads
124HF likes
2025first seen

Nemotron-Pre-Training-Dataset-v1 Release Data Overview This pretraining dataset, for generative AI model training, preserves high-value math and code while enriching it with diverse multilingual Q&A, fueling the next generation of intelligent, globally-capable models. This dataset supports NVIDIA Nemotron Nano 2, a fam

Papers using Nemotron-CC (2)

Nemotron-CC β€” datasets β€” llm-papers