Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

liked a model 2 days ago

kernels-community/vllm-flash-attn3

liked a model 2 days ago

moonshotai/Kimi-K2-Thinking

upvoted an article 4 days ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

View all activity

Organizations

liked 2 models 2 days ago

kernels-community/vllm-flash-attn3

Updated 12 days ago • 32

moonshotai/Kimi-K2-Thinking

Text Generation • Updated about 8 hours ago • 12.5k • • 651

upvoted an article 4 days ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

By

•

Sep 16

• 11

updated a Space 4 days ago

Trl Trackio

Display tracking information

published a Space 4 days ago

Trl Trackio

Display tracking information

upvoted a paper 4 days ago

An efficient probabilistic hardware architecture for diffusion-like models

Paper • 2510.23972 • Published 12 days ago • 3

upvoted a paper 8 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 10 days ago • 40

updated a Space 8 days ago

The Smol Training Playbook: The Secrets to Building World-Class LLMs

upvoted an article 9 days ago

Article

3+ Years of ML & Society at Hugging Face 🤗🤝🧑‍🤝‍🧑

By

and 3 others •

10 days ago

• 13

liked a Space 9 days ago

ML & Society at HF

🤗 machine learning and society team website

upvoted an article 9 days ago

Article

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

13 days ago

• 61

liked 2 Spaces 9 days ago

Smol Training Playbook - Table of Contents

The Smol Training Playbook: The Secrets to Building World-Class LLMs

published a Space 9 days ago

The Smol Training Playbook: The Secrets to Building World-Class LLMs

upvoted an article 9 days ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

By

•

9 days ago

• 21

published a dataset 9 days ago

HuggingFaceTB/OpenR1-Math-220k-default-verified

Viewer • Updated Oct 7 • 105k • 351

liked a dataset 9 days ago

neulab/agent-data-collection

Viewer • Updated Sep 9 • 225k • 6.65k • 78

liked a Space 10 days ago

Unlocking On-Policy Distillation for Any Model Family

upvoted a collection 10 days ago

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 10 days ago • 56

upvoted a paper 10 days ago

Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs

Paper • 2402.12030 • Published Feb 19, 2024 • 3