Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2509.26328

Efficient Diffusion LLM

Efficient-Large-Model/Fast_dLLM_v2_1.5B

2B • Updated Oct 11 • 7.69k • 5
Efficient-Large-Model/Fast_dLLM_v2_7B

333k • Updated Oct 11 • 7.64k • 12
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 44

Diffusion Language Models

Structured Denoising Diffusion Models in Discrete State-Spaces

Paper • 2107.03006 • Published Jul 7, 2021 • 1
Simplified and Generalized Masked Diffusion for Discrete Data

Paper • 2406.04329 • Published Jun 6, 2024 • 8
Simple and Effective Masked Diffusion Language Models

Paper • 2406.07524 • Published Jun 11, 2024 • 12
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video

Paper • 2510.05560 • Published Oct 7 • 7
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7 • 63
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 483
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73
MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 97
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16 • 38
Attention Sinks in Diffusion Language Models

Paper • 2510.15731 • Published Oct 17 • 48
Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 15 days ago • 116

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 483
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48
Improving Context Fidelity via Native Retrieval-Augmented Reasoning

Paper • 2509.13683 • Published Sep 17 • 8
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering

Paper • 2509.00798 • Published Aug 31 • 1

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52

Efficient Diffusion LLM

Efficient-Large-Model/Fast_dLLM_v2_1.5B

2B • Updated Oct 11 • 7.69k • 5
Efficient-Large-Model/Fast_dLLM_v2_7B

333k • Updated Oct 11 • 7.64k • 12
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 44

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16 • 38
Attention Sinks in Diffusion Language Models

Paper • 2510.15731 • Published Oct 17 • 48
Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 15 days ago • 116

Diffusion Language Models

Structured Denoising Diffusion Models in Discrete State-Spaces

Paper • 2107.03006 • Published Jul 7, 2021 • 1
Simplified and Generalized Masked Diffusion for Discrete Data

Paper • 2406.04329 • Published Jun 6, 2024 • 8
Simple and Effective Masked Diffusion Language Models

Paper • 2406.07524 • Published Jun 11, 2024 • 12
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 483
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48
Improving Context Fidelity via Native Retrieval-Augmented Reasoning

Paper • 2509.13683 • Published Sep 17 • 8
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering

Paper • 2509.00798 • Published Aug 31 • 1

HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video

Paper • 2510.05560 • Published Oct 7 • 7
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7 • 63
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 483
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 52

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73
MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 97
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs