Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published Sep 4 • 28
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Paper • 2506.03147 • Published Jun 3 • 58
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper • 2504.10483 • Published Apr 14 • 20
Running on Zero 298 298 Joy Caption Alpha One ⚡ Generate captions for images in various styles and lengths