Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Finnish-NLP
's Collections
Nordic datasets with Fineweb-edu predictions
Continued pretrain research datasets
Ahma models
Finnish Wav2vec2-xlsr speech recognition
Finnish Whisper speech recognition
Finnish pretrain datasets
Finnish SFT/DPO dataset
Finnish-Fineweb-edu
Finnish LLama models
Instruction tuned models
Finnish pretrain datasets
updated
Oct 4
Upvote
-
Finnish-NLP/mc4_fi_cleaned
Viewer
•
Updated
Oct 21, 2022
•
18.1M
•
155
•
4
Finnish-NLP/Reddit_fi_2006_2022
Viewer
•
Updated
Nov 26, 2023
•
4.52M
•
56
•
2
Finnish-NLP/wikipedia_20230501_fi_cleaned
Viewer
•
Updated
May 18, 2023
•
411k
•
34
Finnish-NLP/oscar_2301_fi_cleaned
Viewer
•
Updated
May 19, 2023
•
5.23M
•
218
Finnish-NLP/HPLT_1.2_fi_cleaned
Viewer
•
Updated
Mar 1, 2024
•
5.11M
•
199
Finnish-NLP/CulturaX_fi_cleaned
Viewer
•
Updated
Dec 23, 2023
•
28.8M
•
177
Finnish-NLP/Fineweb2_Finnish_fineweb_edu_predicted
Viewer
•
Updated
Jun 5
•
33.2M
•
355
Finnish-NLP/finepdf_fi_edu_score_topic_classified
Viewer
•
Updated
Sep 14
•
1.98M
•
58
Upvote
-
Share collection
View history
Collection guide
Browse collections