46 59 125

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

authored a paper 15 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

upvoted a paper 16 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

View all activity

Organizations

upvoted a paper 16 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published 24 days ago • 78

upvoted a paper 25 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139

upvoted an article about 2 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Oct 24, 2024

•

upvoted an article 4 months ago

Article

Diffusion Language Models: The New Paradigm

Jun 10, 2025

•

upvoted a paper 4 months ago

Rope to Nope and Back Again: A New Hybrid Attention Strategy

Paper • 2501.18795 • Published Jan 30, 2025 • 12

upvoted an article 5 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

279

upvoted a paper 5 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14, 2025 • 89

upvoted 2 articles 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

741

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

748

upvoted 4 papers 7 months ago

Don't Pay Attention

Paper • 2506.11305 • Published Jun 12, 2025 • 7

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6, 2025 • 30

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Paper • 2505.17870 • Published May 23, 2025 • 5

upvoted a paper 8 months ago

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Paper • 2312.03209 • Published Dec 6, 2023 • 21

upvoted an article 8 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

751

upvoted a paper 9 months ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14, 2025 • 10

upvoted an article 10 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

428

upvoted a paper 10 months ago

Min P Sampling: Balancing Creativity and Coherence at High Temperature

Paper • 2407.01082 • Published Jul 1, 2024 • 1

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12, 2025

•

480

upvoted a paper 10 months ago

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Paper • 2502.18080 • Published Feb 25, 2025 • 2

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Diffusion Language Models: The New Paradigm

How to generate text: using different decoding methods for language generation with Transformers

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Uncensor any LLM with abliteration

You could have designed state of the art positional encoding

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM