some interesting datasets to use for language modeling
Totally Free + Zero Barriers + No Login Required