custom Wikipedia text corpus
Emerging1papers using it
2025first seen
The 'custom Wikipedia text corpus' is a dataset derived from Wikipedia that is used to evaluate language modeling techniques, specifically for predicting the next word based on preceding context.