Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
appvoid
's Collections
cool spaces
main releases
cool datasets
cool datasets
updated
Nov 15
some interesting datasets to use for language modeling
Upvote
-
appvoid/raw-corpus
Viewer
•
Updated
Feb 23
•
1.6M
•
25
pszemraj/simple_wikipedia
Viewer
•
Updated
Sep 9, 2023
•
238k
•
1.1k
•
7
common-pile/youtube
Viewer
•
Updated
Jun 6
•
1.13M
•
416
•
10
srinivasbilla/self-instruct-base
Viewer
•
Updated
Jan 24, 2023
•
82.6k
•
143
•
5
agentlans/high-quality-english-sentences
Viewer
•
Updated
Oct 1, 2024
•
1.71M
•
8.56k
•
27
agentlans/note-taking-v2
Viewer
•
Updated
Sep 22
•
17.6k
•
45
PleIAs/SYNTH
Viewer
•
Updated
Nov 11
•
68M
•
26.7k
•
209
Upvote
-
Share collection
View history
Collection guide
Browse collections