3 19 5

Hao Zhang

zhisbug

haozhangml

AI & ML interests

None yet

Recent Activity

commented on a paper 17 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

upvoted a paper 19 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

upvoted a paper 25 days ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

View all activity

Organizations

commented a paper 17 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 21 days ago • 117 •

upvoted a paper 19 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 21 days ago • 117

upvoted a paper 25 days ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published 28 days ago • 27

upvoted a paper 3 months ago

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8 • 30

New activity in Kijai/WanVideo_comfy 4 months ago

what is fastwan

#47 opened 4 months ago by

markasd

upvoted a paper 5 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24 • 12

upvoted 2 papers 6 months ago

lmgame-Bench: How Good are LLMs at Playing Games?

Paper • 2505.15146 • Published May 21 • 20

Faster Video Diffusion with Trainable Sparse Attention

Paper • 2505.13389 • Published May 19 • 37

upvoted a paper 7 months ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 130

liked a Space 9 months ago

Dynasor

🖼

Embed and interact with a Gradio app

upvoted a paper 9 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

authored a paper 9 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

upvoted 2 papers 9 months ago

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Paper • 2502.06155 • Published Feb 10 • 10

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

authored 6 papers 10 months ago

LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Paper • 2310.03294 • Published Oct 5, 2023 • 2

Online Speculative Decoding

Paper • 2310.07177 • Published Oct 11, 2023 • 3

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 37

Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks

Paper • 2306.13103 • Published Jun 16, 2023 • 2

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Paper • 2401.09670 • Published Jan 18, 2024 • 2

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Paper • 2402.02057 • Published Feb 3, 2024

Hao Zhang

AI & ML interests

Recent Activity

Organizations

zhisbug's activity

what is fastwan

Dynasor