RLCR Collection Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6 • 5
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published Apr 24 • 92
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8 • 180
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D Paper • 2411.02336 • Published Nov 4, 2024 • 24
Platypus: A Generalized Specialist Model for Reading Text in Various Forms Paper • 2408.14805 • Published Aug 27, 2024 • 16
CommonCatalog Collection Common Catalog, a dataset with Creative Commons licensed images and machine-generated caption pairs • 8 items • Updated May 16, 2024 • 16
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Paper • 2406.09162 • Published Jun 13, 2024 • 14
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Paper • 2403.05135 • Published Mar 8, 2024 • 45
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation Paper • 2306.17115 • Published Jun 29, 2023 • 11