Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
minpeter
's Collections
[Dataset] K-Corpus
[Dataset] FineWeb2 Edu Korean
[Model] Very, very small things
[Dataset] Pretrain-corpus
[Model] en-ko trans
[Dataset] Candidate datasets to translate
[Dataset] common-pile korean (Filtered-raw)
[Dataset] PR
[Study] NN MNIST
[Model] FLUX.1 Full Finetuned & Merged
[🛠️] Huggingface Utility
[Dataset] unified standard function calling
[tokenizer] AlternateTokenizer
[Dataset] Function Calling
[Dataset] Pretrain-corpus
updated
Jul 22
Upvote
-
PleIAs/common_corpus
Viewer
•
Updated
Jun 10
•
470M
•
40.6k
•
318
EssentialAI/essential-web-v1.0
Preview
•
Updated
Oct 2
•
28.9k
•
206
HuggingFaceFW/fineweb
Viewer
•
Updated
Jul 11
•
52.5B
•
285k
•
2.45k
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jul 11
•
3.5B
•
220k
•
813
HuggingFaceFW/fineweb-2
Viewer
•
Updated
23 days ago
•
4.48B
•
86.6k
•
689
data-is-better-together/fineweb-c
Viewer
•
Updated
Jul 8
•
88.7k
•
1.39k
•
57
allenai/dolmino-mix-1124
Viewer
•
Updated
21 days ago
•
170M
•
19.7k
•
84
allenai/dolma
Updated
Apr 17, 2024
•
1.39k
•
957
allenai/olmo-mix-1124
Viewer
•
Updated
Aug 19
•
621M
•
14.2k
•
80
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
Jul 22, 2024
•
1.15M
•
243
Zyphra/Zyda-2
Preview
•
Updated
Aug 6
•
291k
•
85
Upvote
-
Share collection
View history
Collection guide
Browse collections