← all datasets

text-8

Emerging
1papers using it
2025first seen

The 'text-8' dataset is a benchmark that contains a large corpus of English text used to evaluate language modeling and text processing algorithms.

Papers using text-8 (1)

text-8 β€” datasets β€” learning-to-hash