Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2506.20920

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27 • 4.48B • 90.6k • 700
Running

82

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

82

Evaluate multilingual models using FineTasks

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 78
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 35
apple/DCLM-7B

7B • Updated Jul 26, 2024 • 234 • 832
Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 111

ahmedheakl/resume-atlas

Viewer • Updated Jul 1, 2024 • 13.4k • 213 • 10
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75
Running

279

Infinite Dataset Hub

♾

279

Search and save datasets generated with a LLM in real time
IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Paper • 2509.06652 • Published Sep 8 • 24

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27 • 4.48B • 90.6k • 700
Running

82

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

82

Evaluate multilingual models using FineTasks

ahmedheakl/resume-atlas

Viewer • Updated Jul 1, 2024 • 13.4k • 213 • 10
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75
Running

279

Infinite Dataset Hub

♾

279

Search and save datasets generated with a LLM in real time
IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Paper • 2509.06652 • Published Sep 8 • 24

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 78
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 35
apple/DCLM-7B

7B • Updated Jul 26, 2024 • 234 • 832
Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 111

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs