Qin Zhou's picture

In a Training Loop 🔄

6 2

Qin Zhou

Matrix53

·

https://matrix53.github.io

AI & ML interests

Computer Vision, Diffusion Model, Video Generation

Recent Activity

authored a paper 4 days ago

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

authored a paper 4 days ago

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

upvoted a collection 4 days ago

View all activity

Organizations

None yet

upvoted a collection 4 days ago

Papers

2 items • Updated 4 days ago • 1

upvoted 2 papers 4 days ago

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

Paper • 2309.02773 • Published Sep 6, 2023 • 1

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

Paper • 2506.09740 • Published Jun 11 • 1

upvoted an article 5 days ago

Article

Diffusers welcomes FLUX-2

+6

18 days ago

•

162

upvoted a paper 11 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 15 days ago • 38

upvoted a paper over 1 year ago

StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

Paper • 2403.09055 • Published Mar 14, 2024 • 27