text-8
Emerging2papers using it
15HF downloads
0HF likes
2024first seen
The 'text8' dataset is a benchmark that contains a large corpus of English text used to evaluate generative modeling and sampling strategies in discrete domains.
The 'text8' dataset is a benchmark that contains a large corpus of English text used to evaluate generative modeling and sampling strategies in discrete domains.