← all datasets

text-8

Emerging
2papers using it
15HF downloads
0HF likes
2024first seen

The 'text8' dataset is a benchmark that contains a large corpus of English text used to evaluate generative modeling and sampling strategies in discrete domains.

Papers using text-8 (2)

text-8 β€” datasets β€” generative-models