-
Efficient-Large-Model/Fast_dLLM_v2_1.5B
2B • Updated • 7.69k • 5 -
Efficient-Large-Model/Fast_dLLM_v2_7B
333k • Updated • 7.64k • 12 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52 -
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
Paper • 2505.22618 • Published • 44
Collections
Discover the best community collections!
Collections including paper arxiv:2509.26328
-
Structured Denoising Diffusion Models in Discrete State-Spaces
Paper • 2107.03006 • Published • 1 -
Simplified and Generalized Masked Diffusion for Discrete Data
Paper • 2406.04329 • Published • 8 -
Simple and Effective Masked Diffusion Language Models
Paper • 2406.07524 • Published • 12 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 122
-
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper • 2510.05560 • Published • 7 -
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Paper • 2510.06217 • Published • 63 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52
-
Large Language Diffusion Models
Paper • 2502.09992 • Published • 122 -
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Paper • 2503.09573 • Published • 73 -
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 38 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 48 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 116
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Paper • 2509.13683 • Published • 8 -
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering
Paper • 2509.00798 • Published • 1
-
Efficient-Large-Model/Fast_dLLM_v2_1.5B
2B • Updated • 7.69k • 5 -
Efficient-Large-Model/Fast_dLLM_v2_7B
333k • Updated • 7.64k • 12 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52 -
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
Paper • 2505.22618 • Published • 44
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 38 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 48 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 116
-
Structured Denoising Diffusion Models in Discrete State-Spaces
Paper • 2107.03006 • Published • 1 -
Simplified and Generalized Masked Diffusion for Discrete Data
Paper • 2406.04329 • Published • 8 -
Simple and Effective Masked Diffusion Language Models
Paper • 2406.07524 • Published • 12 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 122
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Paper • 2509.13683 • Published • 8 -
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering
Paper • 2509.00798 • Published • 1
-
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper • 2510.05560 • Published • 7 -
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Paper • 2510.06217 • Published • 63 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52
-
Large Language Diffusion Models
Paper • 2502.09992 • Published • 122 -
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Paper • 2503.09573 • Published • 73 -
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54