Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qin Zhou's picture
In a Training Loop 🔄
6 2

Qin Zhou

Matrix53
·
https://matrix53.github.io
  • ACMatrix53
  • Matrix53

AI & ML interests

Computer Vision, Diffusion Model, Video Generation

Recent Activity

authored a paper 4 days ago
ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models
authored a paper 4 days ago
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
upvoted a collection 4 days ago
Papers
View all activity

Organizations

None yet

upvoted a collection 4 days ago

Papers

Collection
2 items • Updated 4 days ago • 1
upvoted 2 papers 4 days ago

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

Paper • 2309.02773 • Published Sep 6, 2023 • 1

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

Paper • 2506.09740 • Published Jun 11 • 1
upvoted an article 5 days ago
view article
Article

Diffusers welcomes FLUX-2

  • +6
18 days ago
•
162
upvoted a paper 11 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 15 days ago • 38
upvoted a paper over 1 year ago

StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

Paper • 2403.09055 • Published Mar 14, 2024 • 27
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs