Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted an article about 12 hours ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

updated a Space about 14 hours ago

lewtun/trl-trackio

published a Space about 14 hours ago

lewtun/trl-trackio

View all activity

Organizations

upvoted an article about 12 hours ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

By

•

Sep 16

• 7

upvoted a paper about 22 hours ago

An efficient probabilistic hardware architecture for diffusion-like models

Paper • 2510.23972 • Published 8 days ago • 3

upvoted a paper 5 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 6 days ago • 39

upvoted an article 5 days ago

Article

3+ Years of ML & Society at Hugging Face 🤗🤝🧑‍🤝‍🧑

By

and 3 others •

7 days ago

• 13

upvoted 2 articles 6 days ago

Article

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

9 days ago

• 53

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

By

•

6 days ago

• 21

upvoted a collection 7 days ago

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 7 days ago • 55

upvoted 2 papers 7 days ago

Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs

Paper • 2402.12030 • Published Feb 19, 2024 • 3

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 246

upvoted a paper 8 days ago

Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine

Paper • 2510.21614 • Published 12 days ago • 19

upvoted an article 9 days ago

Article

Streaming datasets: 100x More Efficient

9 days ago

• 47

upvoted 2 papers 9 days ago

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 17

Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24

upvoted a paper 10 days ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 49

upvoted a paper 12 days ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 72

upvoted a changelog 12 days ago

Changelog

Cleaner Collection URLs

13 days ago

• 65

upvoted an article 13 days ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

13 days ago

• 115

upvoted 2 papers 13 days ago

Bridging Offline and Online Reinforcement Learning for LLMs

Paper • 2506.21495 • Published Jun 26 • 3

CWM: An Open-Weights LLM for Research on Code Generation with World Models

Paper • 2510.02387 • Published Sep 30 • 7

upvoted a collection 13 days ago

Environment Hub

A collection of OpenEnv-spec Environments • 5 items • Updated 13 days ago • 10