Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter Paper • 2309.02773 • Published Sep 6, 2023 • 1
ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models Paper • 2506.09740 • Published Jun 11 • 1
Geometrically-Constrained Agent for Spatial Reasoning Paper • 2511.22659 • Published 15 days ago • 38
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control Paper • 2403.09055 • Published Mar 14, 2024 • 27