-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 77
Collections
Discover the best community collections!
Collections including paper arxiv:2411.15466
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37 -
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
Paper • 2411.17223 • Published • 7 -
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
Paper • 2411.17176 • Published • 24
-
Live Portrait
🤪3.64kApply the motion of a video on a portrait
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Material Anything: Generating Materials for Any 3D Object via Diffusion
Paper • 2411.15138 • Published • 50
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 24 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 30 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 131 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 41
-
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
Paper • 2508.18966 • Published • 56 -
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
Paper • 2509.08721 • Published • 673 -
FastVLM: Efficient Vision Encoding for Vision Language Models
Paper • 2412.13303 • Published • 71 -
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39
-
Zero-shot Image Editing with Reference Imitation
Paper • 2406.07547 • Published • 33 -
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
Paper • 2406.10601 • Published • 70 -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper • 2407.05282 • Published • 16 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 77
-
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
Paper • 2508.18966 • Published • 56 -
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
Paper • 2509.08721 • Published • 673 -
FastVLM: Efficient Vision Encoding for Vision Language Models
Paper • 2412.13303 • Published • 71 -
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37 -
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
Paper • 2411.17223 • Published • 7 -
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
Paper • 2411.17176 • Published • 24
-
Live Portrait
🤪3.64kApply the motion of a video on a portrait
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Material Anything: Generating Materials for Any 3D Object via Diffusion
Paper • 2411.15138 • Published • 50
-
Zero-shot Image Editing with Reference Imitation
Paper • 2406.07547 • Published • 33 -
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
Paper • 2406.10601 • Published • 70 -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper • 2407.05282 • Published • 16 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 24 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 30 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 131 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 41