119 16 20

Omkar Pangarkar

omkarenator

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

SmolLM3: smol, multilingual, long-context reasoner

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

liked a dataset 6 days ago

bigcode/the-stack-github-issues

View all activity

Organizations

upvoted an article 4 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 715

liked a Space 5 days ago

1.68k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

liked a dataset 6 days ago

bigcode/the-stack-github-issues

Viewer • Updated Mar 20, 2023 • 31M • 326 • 47

upvoted a paper 6 days ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 149

upvoted a collection about 1 month ago

The Ultimate Collection of Code Classifiers

Collection

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5 • 15

upvoted a paper 3 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 46

upvoted an article 4 months ago

Article

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

and 2 others •

Oct 23, 2024

• 5

liked a Space 6 months ago

Predict Memory

🧮

Calculate memory usage for model configurations

upvoted a paper 7 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93

liked a dataset 7 months ago

WebOrganizer/Corpus-200B

Preview • Updated Feb 19 • 11.7k • 10

liked a Space 7 months ago

125

TxT360: Trillion Extracted Text

📖

Explore and utilize a large, deduplicated text dataset for LLM training

liked a model 9 months ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 27

liked a Space 9 months ago

3.45k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in LLM360/TxT360 9 months ago

fix-deps

#7 opened 9 months ago by

omkarenator

updated a Space 9 months ago

125

TxT360: Trillion Extracted Text

📖

Explore and utilize a large, deduplicated text dataset for LLM training

New activity in LLM360/TxT360 9 months ago

code-formatting

#6 opened 9 months ago by

omkarenator

liked a Space 9 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

New activity in LLM360/TxT360 about 1 year ago

Add citations and other fixes

#4 opened about 1 year ago by

omkarenator

liked a dataset about 1 year ago

LLM360/TxT360

Updated May 26 • 53.5k • 240

Omkar Pangarkar

AI & ML interests

Recent Activity

Organizations

omkarenator's activity

SmolLM3: smol, multilingual, long-context reasoner

The Smol Training Playbook: The Secrets to Building World-Class LLMs

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

Predict Memory

TxT360: Trillion Extracted Text

The Ultra-Scale Playbook

fix-deps

TxT360: Trillion Extracted Text

code-formatting

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Open-R1: a fully open reproduction of DeepSeek-R1

Add citations and other fixes