zzfive
			's Collections
			 
		
			
				
				
	
	
	
			
			TextureDreamer: Image-guided Texture Synthesis through Geometry-aware
  Diffusion
		
			Paper
			
•
			2401.09416
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			SHINOBI: Shape and Illumination using Neural Object Decomposition via
  BRDF Optimization In-the-wild
		
			Paper
			
•
			2401.10171
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction
  Model
		
			Paper
			
•
			2311.09217
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			GALA: Generating Animatable Layered Assets from a Single Scan
		
			Paper
			
•
			2401.12979
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural
  Radiance Fields
		
			Paper
			
•
			2401.17895
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			Advances in 3D Generation: A Survey
		
			Paper
			
•
			2401.17807
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			AToM: Amortized Text-to-Mesh using 2D Diffusion
		
			Paper
			
•
			2402.00867
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object
  with Gaussian Splatting
		
			Paper
			
•
			2402.10259
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for
  Single or Sparse-view 3D Object Reconstruction
		
			Paper
			
•
			2402.12712
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			FlashTex: Fast Relightable Mesh Texturing with LightControlNet
		
			Paper
			
•
			2402.13251
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Consolidating Attention Features for Multi-view Image Editing
		
			Paper
			
•
			2402.14792
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			MVD^2: Efficient Multiview 3D Reconstruction for Multiview Diffusion
		
			Paper
			
•
			2402.14253
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
		
			Paper
			
•
			2402.18842
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			TripoSR: Fast 3D Object Reconstruction from a Single Image
		
			Paper
			
•
			2403.02151
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
		
			Paper
			
•
			2403.01807
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction
  Model
		
			Paper
			
•
			2403.05034
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			3D-VLA: A 3D Vision-Language-Action Generative World Model
		
			Paper
			
•
			2403.09631
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			GVGEN: Text-to-3D Generation with Volumetric Representation
		
			Paper
			
•
			2403.12957
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
		
			Paper
			
•
			2403.12365
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
		
			Paper
			
•
			2403.12906
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			Compress3D: a Compressed Latent Space for 3D Generation from a Single
  Image
		
			Paper
			
•
			2403.13524
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
		
			Paper
			
•
			2403.17001
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Gamba: Marry Gaussian Splatting with Mamba for single view 3D
  reconstruction
		
			Paper
			
•
			2403.18795
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			GaussianCube: Structuring Gaussian Splatting using Optimal Transport for
  3D Generative Modeling
		
			Paper
			
•
			2403.19655
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field
  Representation and Generation
		
			Paper
			
•
			2403.19319
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
		
			Paper
			
•
			2404.00987
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			PointInfinity: Resolution-Invariant Point Diffusion Models
		
			Paper
			
•
			2404.03566
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			Robust Gaussian Splatting
		
			Paper
			
•
			2404.04211
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion
		
			Paper
			
•
			2404.06429
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based
  Monocular Guidance
		
			Paper
			
•
			2404.08252
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			CompGS: Efficient 3D Scene Representation via Compressed Gaussian
  Splatting
		
			Paper
			
•
			2404.09458
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			Taming Latent Diffusion Model for Neural Radiance Field Inpainting
		
			Paper
			
•
			2404.09995
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			MeshLRM: Large Reconstruction Model for High-Quality Mesh
		
			Paper
			
•
			2404.12385
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Interactive3D: Create What You Want by Interactive 3D Generation
		
			Paper
			
•
			2404.16510
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			CAT3D: Create Anything in 3D with Multi-View Diffusion Models
		
			Paper
			
•
			2405.10314
			
•
			Published
				
			•
				
				48
			
 
	
	 
	
	
	
			
			Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode
  Multi-view Latent Diffusion
		
			Paper
			
•
			2405.09874
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching
		
			Paper
			
•
			2405.11252
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and
  Interactive Geometry Refiner
		
			Paper
			
•
			2405.14979
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed
  via Gaussian Splatting
		
			Paper
			
•
			2405.15125
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with
  Dynamic Gaussian Surfels
		
			Paper
			
•
			2405.16822
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Part123: Part-aware 3D Reconstruction from a Single-view Image
		
			Paper
			
•
			2405.16888
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			GFlow: Recovering 4D World from Monocular Video
		
			Paper
			
•
			2405.18426
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian
  Splatting
		
			Paper
			
•
			2405.18424
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			NPGA: Neural Parametric Gaussian Avatars
		
			Paper
			
•
			2405.19331
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			GECO: Generative Image-to-3D within a SECOnd
		
			Paper
			
•
			2405.20327
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting
		
			Paper
			
•
			2405.19957
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			4Diffusion: Multi-view Video Diffusion Model for 4D Generation
		
			Paper
			
•
			2405.20674
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
		
			Paper
			
•
			2406.03184
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion
  Models
		
			Paper
			
•
			2406.07472
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			Physics3D: Learning Physical Properties of 3D Gaussians via Video
  Diffusion
		
			Paper
			
•
			2406.04338
			
•
			Published
				
			•
				
				39
			
 
	
	 
	
	
	
			
			3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and
  Less Hallucination
		
			Paper
			
•
			2406.05132
			
•
			Published
				
			•
				
				30
			
 
	
	 
	
	
	
			
			Real3D: Scaling Up Large Reconstruction Models with Real-World Images
		
			Paper
			
•
			2406.08479
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			LRM-Zero: Training Large Reconstruction Models with Synthesized Data
		
			Paper
			
•
			2406.09371
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
		
			Paper
			
•
			2406.10111
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			MeshAnything: Artist-Created Mesh Generation with Autoregressive
  Transformers
		
			Paper
			
•
			2406.10163
			
•
			Published
				
			•
				
				33
			
 
	
	 
	
	
	
			
			L4GM: Large 4D Gaussian Reconstruction Model
		
			Paper
			
•
			2406.10324
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians
		
			Paper
			
•
			2406.16815
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			YouDream: Generating Anatomically Controllable Consistent Text-to-3D
  Animals
		
			Paper
			
•
			2406.16273
			
•
			Published
				
			•
				
				43
			
 
	
	 
	
	
	
			
			GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly
  Enhanced Quality
		
			Paper
			
•
			2406.18462
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side
  Images
		
			Paper
			
•
			2407.06191
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
		
			Paper
			
•
			2407.06938
			
•
			Published
				
			•
				
				25
			
 
	
	 
	
	
	
			
			Controlling Space and Time with Diffusion Models
		
			Paper
			
•
			2407.07860
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			StyleSplat: 3D Object Style Transfer with Gaussian Splatting
		
			Paper
			
•
			2407.09473
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			CharacterGen: Efficient 3D Character Generation from Single Images with
  Multi-View Pose Canonicalization
		
			Paper
			
•
			2402.17214
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			DreamCatalyst: Fast and High-Quality 3D Editing via Controlling
  Editability and Identity Preservation
		
			Paper
			
•
			2407.11394
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
		
			Paper
			
•
			2407.11398
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
		
			Paper
			
•
			2407.11793
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for
  Unconstrained Photo Collections
		
			Paper
			
•
			2407.12306
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Shape of Motion: 4D Reconstruction from a Single Video
		
			Paper
			
•
			2407.13764
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			PlacidDreamer: Advancing Harmony in Text-to-3D Generation
		
			Paper
			
•
			2407.13976
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis
  in Large-scale Scenes
		
			Paper
			
•
			2407.15848
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			HoloDreamer: Holistic 3D Panoramic World Generation from Text
  Descriptions
		
			Paper
			
•
			2407.15187
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			Temporal Residual Jacobians For Rig-free Motion Transfer
		
			Paper
			
•
			2407.14958
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
		
			Paper
			
•
			2407.12435
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View
  Consistency
		
			Paper
			
•
			2407.17470
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car
  Reconstruction
		
			Paper
			
•
			2407.16988
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Floating No More: Object-Ground Reconstruction from a Single Image
		
			Paper
			
•
			2407.18914
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			Cycle3D: High-quality and Consistent Image-to-3D Generation via
  Generation-Reconstruction Cycle
		
			Paper
			
•
			2407.19548
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Expressive Whole-Body 3D Gaussian Avatar
		
			Paper
			
•
			2407.21686
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Improving 2D Feature Representations by 3D-Aware Fine-Tuning
		
			Paper
			
•
			2407.20229
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields
		
			Paper
			
•
			2404.01300
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and
  Illumination Disentanglement
		
			Paper
			
•
			2408.00653
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
			
			TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and
  Resampling
		
			Paper
			
•
			2408.01291
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh
  Tokenization
		
			Paper
			
•
			2408.02555
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
			
			An Object is Worth 64x64 Pixels: Generating 3D Object via Image
  Diffusion
		
			Paper
			
•
			2408.03178
			
•
			Published
				
			•
				
				40
			
 
	
	 
	
	
	
			
			RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel
  View Synthesis
		
			Paper
			
•
			2408.03356
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields
		
			Paper
			
•
			2408.03822
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from
  User's Casual Sketches
		
			Paper
			
•
			2408.04567
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			FruitNeRF: A Unified Neural Radiance Field based Fruit Counting
  Framework
		
			Paper
			
•
			2408.06190
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors
		
			Paper
			
•
			2408.06019
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			SlotLifter: Slot-guided Feature Lifting for Learning Object-centric
  Radiance Fields
		
			Paper
			
•
			2408.06697
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			3D Gaussian Editing with A Single Image
		
			Paper
			
•
			2408.07540
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and
  3D Editing
		
			Paper
			
•
			2408.08000
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction
  Model
		
			Paper
			
•
			2408.10198
			
•
			Published
				
			•
				
				35
			
 
	
	 
	
	
	
			
			SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
  Views
		
			Paper
			
•
			2408.10195
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their
  Self-Supervised Pretraining
		
			Paper
			
•
			2408.10906
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			DreamCinema: Cinematic Transfer with Free Camera and 3D Character
		
			Paper
			
•
			2408.12601
			
•
			Published
				
			•
				
				31
			
 
	
	 
	
	
	
			
			Subsurface Scattering for 3D Gaussian Splatting
		
			Paper
			
•
			2408.12282
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
		
			Paper
			
•
			2408.13252
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			T3M: Text Guided 3D Human Motion Synthesis from Speech
		
			Paper
			
•
			2408.12885
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting
  for Customizable Rendering
		
			Paper
			
•
			2408.12894
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware
  Diffusion and Iterative Refinement
		
			Paper
			
•
			2408.14211
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Towards Realistic Example-based Modeling via 3D Gaussian Stitching
		
			Paper
			
•
			2408.15708
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion
  Model
		
			Paper
			
•
			2408.16767
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
			
			SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners
		
			Paper
			
•
			2408.16768
			
•
			Published
				
			•
				
				28
			
 
	
	 
	
	
	
			
			3D Reconstruction with Spatial Memory
		
			Paper
			
•
			2408.16061
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			GST: Precise 3D Human Body from a Single Image with Gaussian Splatting
  Transformers
		
			Paper
			
•
			2409.04196
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			UniDet3D: Multi-dataset Indoor 3D Object Detection
		
			Paper
			
•
			2409.04234
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video
  Diffusion Models
		
			Paper
			
•
			2409.07452
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
		
			Paper
			
•
			2409.08270
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Phidias: A Generative Model for Creating 3D Content from Text, Image,
  and 3D Conditions with Reference-Augmented Diffusion
		
			Paper
			
•
			2409.11406
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction
		
			Paper
			
•
			2409.11211
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Vista3D: Unravel the 3D Darkside of a Single Image
		
			Paper
			
•
			2409.12193
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive
  Diffusion
		
			Paper
			
•
			2409.12957
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			FlexiTex: Enhancing Texture Generation with Visual Guidance
		
			Paper
			
•
			2409.12431
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt
		
			Paper
			
•
			2409.12892
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Portrait Video Editing Empowered by Multimodal Generative Priors
		
			Paper
			
•
			2409.13591
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D
  Diffusion
		
			Paper
			
•
			2409.17145
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Game4Loc: A UAV Geo-Localization Benchmark from Game Data
		
			Paper
			
•
			2409.16925
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans
		
			Paper
			
•
			2409.16666
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with
  3D-awareness
		
			Paper
			
•
			2409.18125
			
•
			Published
				
			•
				
				34
			
 
	
	 
	
	
	
			
			Disco4D: Disentangled 4D Human Generation and Animation from a Single
  Image
		
			Paper
			
•
			2409.17280
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			MonST3R: A Simple Approach for Estimating Geometry in the Presence of
  Motion
		
			Paper
			
•
			2410.03825
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion
  Models
		
			Paper
			
•
			2409.19989
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			Semantic Score Distillation Sampling for Compositional Text-to-3D
  Generation
		
			Paper
			
•
			2410.09009
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			GS^3: Efficient Relighting with Triple Gaussian Splatting
		
			Paper
			
•
			2410.11419
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage
  Gaussian Splats
		
			Paper
			
•
			2410.12781
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without
  Learned Priors
		
			Paper
			
•
			2410.16271
			
•
			Published
				
			•
				
				84
			
 
	
	 
	
	
	
			
			SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
		
			Paper
			
•
			2410.17249
			
•
			Published
				
			•
				
				42
			
 
	
	 
	
	
	
			
			3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with
  View-consistent 2D Diffusion Priors
		
			Paper
			
•
			2410.16266
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes
		
			Paper
			
•
			2410.18084
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias
		
			Paper
			
•
			2410.17242
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			MotionCLR: Motion Generation and Training-free Editing via Understanding
  Attention Mechanisms
		
			Paper
			
•
			2410.18977
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling
		
			Paper
			
•
			2410.18912
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
		
			Paper
			
•
			2411.02336
			
•
			Published
				
			•
				
				24
			
 
	
	 
	
	
	
			
			GenXD: Generating Any 3D and 4D Scenes
		
			Paper
			
•
			2411.02319
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			AutoVFX: Physically Realistic Video Editing from Natural Language
  Instructions
		
			Paper
			
•
			2411.02394
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			DreamPolish: Domain Score Distillation With Progressive Geometry
  Generation
		
			Paper
			
•
			2411.01602
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single
  In-the-Wild Image using a Dataset with Levels of Details
		
			Paper
			
•
			2411.03047
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			DimensionX: Create Any 3D and 4D Scenes from a Single Image with
  Controllable Video Diffusion
		
			Paper
			
•
			2411.04928
			
•
			Published
				
			•
				
				57
			
 
	
	 
	
	
	
			
			StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
		
			Paper
			
•
			2411.05738
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			KMM: Key Frame Mask Mamba for Extended Motion Generation
		
			Paper
			
•
			2411.06481
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			SAMPart3D: Segment Any Part in 3D Objects
		
			Paper
			
•
			2411.07184
			
•
			Published
				
			•
				
				28
			
 
	
	 
	
	
	
			
			Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model
  with Compact Wavelet Encodings
		
			Paper
			
•
			2411.08017
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
		
			Paper
			
•
			2411.09595
			
•
			Published
				
			•
				
				77
			
 
	
	 
	
	
	
			
			GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D
  Generation
		
			Paper
			
•
			2411.08033
			
•
			Published
				
			•
				
				25
			
 
	
	 
	
	
	
			
			VeGaS: Video Gaussian Splatting
		
			Paper
			
•
			2411.11024
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable
  Single-stage Image-to-3D Generation
		
			Paper
			
•
			2411.14384
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Material Anything: Generating Materials for Any 3D Object via Diffusion
		
			Paper
			
•
			2411.15138
			
•
			Published
				
			•
				
				50
			
 
	
	 
	
	
	
			
			SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting
  Synthesis
		
			Paper
			
•
			2411.16443
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
		
			Paper
			
•
			2411.13550
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			TEXGen: a Generative Diffusion Model for Mesh Textures
		
			Paper
			
•
			2411.14740
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			Learning 3D Representations from Procedural 3D Programs
		
			Paper
			
•
			2411.17467
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			SAR3D: Autoregressive 3D Object Generation and Understanding via
  Multi-scale 3D VQVAE
		
			Paper
			
•
			2411.16856
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
		
			Paper
			
•
			2411.18613
			
•
			Published
				
			•
				
				58
			
 
	
	 
	
	
	
			
			MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D
  Content Creation
		
			Paper
			
•
			2411.17945
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready
  3D Characters
		
			Paper
			
•
			2411.18197
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow
  Decoding
		
			Paper
			
•
			2411.19527
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction
  with 3D Autonomous Characters
		
			Paper
			
•
			2412.00174
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			World-consistent Video Diffusion with Explicit 3D Modeling
		
			Paper
			
•
			2412.01821
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Imagine360: Immersive 360 Video Generation from Perspective Anchor
		
			Paper
			
•
			2412.03552
			
•
			Published
				
			•
				
				29
			
 
	
	 
	
	
	
			
			Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion
		
			Paper
			
•
			2412.03515
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene
  Understanding
		
			Paper
			
•
			2412.00493
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
		
			Paper
			
•
			2412.03558
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			Structured 3D Latents for Scalable and Versatile 3D Generation
		
			Paper
			
•
			2412.01506
			
•
			Published
				
			•
				
				83
			
 
	
	 
	
	
	
			
			MV-Adapter: Multi-view Consistent Image Generation Made Easy
		
			Paper
			
•
			2412.03632
			
•
			Published
				
			•
				
				24
			
 
	
	 
	
	
	
			
			Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large
  Scene Reconstruction
		
			Paper
			
•
			2412.04887
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains
  for High-Fidelity Indoor Scene Reconstruction
		
			Paper
			
•
			2412.03428
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			You See it, You Got it: Learning 3D Creation on Pose-Free Videos at
  Scale
		
			Paper
			
•
			2412.06699
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and
  Photorealism From Sparse Views
		
			Paper
			
•
			2412.06767
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Turbo3D: Ultra-fast Text-to-3D Generation
		
			Paper
			
•
			2412.04470
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Neural LightRig: Unlocking Accurate Object Normal and Material
  Estimation with Multi-Light Diffusion
		
			Paper
			
•
			2412.09593
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh
  Representations
		
			Paper
			
•
			2412.05994
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			GenEx: Generating an Explorable World
		
			Paper
			
•
			2412.09624
			
•
			Published
				
			•
				
				97
			
 
	
	 
	
	
	
			
			IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and
  Illuminations
		
			Paper
			
•
			2412.12083
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			GaussianProperty: Integrating Physical Properties to 3D Gaussians with
  LMMs
		
			Paper
			
•
			2412.11258
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation
  for High-quality 3D Asset Creation
		
			Paper
			
•
			2412.15200
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Sequence Matters: Harnessing Video Models in 3D Super-Resolution
		
			Paper
			
•
			2412.11525
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D
  Scene Understanding
		
			Paper
			
•
			2412.18450
			
•
			Published
				
			•
				
				36
			
 
	
	 
	
	
	
			
			DepthLab: From Partial to Complete
		
			Paper
			
•
			2412.18153
			
•
			Published
				
			•
				
				36
			
 
	
	 
	
	
	
			
			PartGen: Part-level 3D Generation and Reconstruction with Multi-View
  Diffusion Models
		
			Paper
			
•
			2412.18608
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			Orient Anything: Learning Robust Object Orientation Estimation from
  Rendering 3D Models
		
			Paper
			
•
			2412.18605
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single
  Images
		
			Paper
			
•
			2501.04689
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation
		
			Paper
			
•
			2501.04144
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
		
			Paper
			
•
			2501.07574
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
		
			Paper
			
•
			2501.08983
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
		
			Paper
			
•
			2501.09433
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar
  Editor
		
			Paper
			
•
			2501.09978
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D
  Assets Generation
		
			Paper
			
•
			2501.12202
			
•
			Published
				
			•
				
				47
			
 
	
	 
	
	
	
			
			GSTAR: Gaussian Surface Tracking and Reconstruction
		
			Paper
			
•
			2501.10283
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Relightable Full-Body Gaussian Codec Avatars
		
			Paper
			
•
			2501.14726
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Multiview Equivariance Improves 3D Correspondence Understanding with
  Minimal Feature Finetuning
		
			Paper
			
•
			2411.19458
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian
  Splat Generation
		
			Paper
			
•
			2501.16764
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric
  Diffusion
		
			Paper
			
•
			2501.18804
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Fast Encoder-Based 3D from Casual Videos via Point Track Processing
		
			Paper
			
•
			2404.07097
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Text-to-CAD Generation Through Infusing Visual Feedback in Large
  Language Models
		
			Paper
			
•
			2501.19054
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			DreamDPO: Aligning Text-to-3D Generation with Human Preferences via
  Direct Preference Optimization
		
			Paper
			
•
			2502.04370
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			CAD-Editor: A Locate-then-Infill Framework with Automated Training Data
  Synthesis for Text-Based CAD Editing
		
			Paper
			
•
			2502.03997
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Exploring the Potential of Encoder-free Architectures in 3D LMMs
		
			Paper
			
•
			2502.09620
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified
  Flow Models
		
			Paper
			
•
			2502.06608
			
•
			Published
				
			•
				
				40
			
 
	
	 
	
	
	
			
			Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and
  Texture Generation
		
			Paper
			
•
			2502.14247
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via
  Sparse Time-Variant Attribute Modeling
		
			Paper
			
•
			2502.20378
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
		
			Paper
			
•
			2503.01774
			
•
			Published
				
			•
				
				44
			
 
	
	 
	
	
	
			
			Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
		
			Paper
			
•
			2503.01370
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling
		
			Paper
			
•
			2503.09601
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large
  Language Models
		
			Paper
			
•
			2503.10437
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
			
			TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree
  Sequencing
		
			Paper
			
•
			2503.11629
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Unleashing Vecset Diffusion Model for Fast Shape Generation
		
			Paper
			
•
			2503.16302
			
•
			Published
				
			•
				
				43
			
 
	
	 
	
	
	
			
			DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement
  Learning
		
			Paper
			
•
			2503.15265
			
•
			Published
				
			•
				
				46
			
 
	
	 
	
	
	
			
			DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis
		
			Paper
			
•
			2503.15667
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
		
			Paper
			
•
			2503.21732
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal
  Bridging
		
			Paper
			
•
			2503.22236
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Progressive Rendering Distillation: Adapting Stable Diffusion for
  Instant Text-to-Mesh Generation without 3D Data
		
			Paper
			
•
			2503.21694
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			MeshCraft: Exploring Efficient and Controllable Mesh Generation with
  Flow-based DiTs
		
			Paper
			
•
			2503.23022
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			DSO: Aligning 3D Generators with Simulation Feedback for Physical
  Soundness
		
			Paper
			
•
			2503.22677
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in
  One Step
		
			Paper
			
•
			2504.01956
			
•
			Published
				
			•
				
				41
			
 
	
	 
	
	
	
			
			HoloPart: Generative 3D Part Amodal Segmentation
		
			Paper
			
•
			2504.07943
			
•
			Published
				
			•
				
				28
			
 
	
	 
	
	
	
			
			In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
		
			Paper
			
•
			2504.08366
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
		
			Paper
			
•
			2504.05303
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			3D CoCa: Contrastive Learners are 3D Captioners
		
			Paper
			
•
			2504.09518
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			MCP Safety Audit: LLMs with the Model Context Protocol Allow Major
  Security Exploits
		
			Paper
			
•
			2504.03767
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Diffusion Distillation With Direct Preference Optimization For Efficient
  3D LiDAR Scene Completion
		
			Paper
			
•
			2504.11447
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Vivid4D: Improving 4D Reconstruction from Monocular Video by Video
  Inpainting
		
			Paper
			
•
			2504.11092
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via
  Adaptive Block-Based Gaussian Splatting
		
			Paper
			
•
			2504.09048
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation
		
			Paper
			
•
			2504.13072
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on
  3D Gaussians
		
			Paper
			
•
			2504.15281
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			DiMeR: Disentangled Mesh Reconstruction Model
		
			Paper
			
•
			2504.17670
			
•
			Published
				
			•
				
				24
			
 
	
	 
	
	
	
			
			HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene
  Generation
		
			Paper
			
•
			2504.21650
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			Scenethesis: A Language and Vision Agentic Framework for 3D Scene
  Generation
		
			Paper
			
•
			2505.02836
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with
  Auto-Regressive Transformer
		
			Paper
			
•
			2505.04622
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			3D Scene Generation: A Survey
		
			Paper
			
•
			2505.05474
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
		
			Paper
			
•
			2505.05288
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured
  3D Assets
		
			Paper
			
•
			2505.07747
			
•
			Published
				
			•
				
				61
			
 
	
	 
	
	
	
			
			Constructing a 3D Town from a Single Image
		
			Paper
			
•
			2505.15765
			
•
			Published
				
			•
				
				24
			
 
	
	 
	
	
	
			
			Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse
  Attention
		
			Paper
			
•
			2505.17412
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and
  Styles
		
			Paper
			
•
			2505.21060
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
		
			Paper
			
•
			2505.23253
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian
  Splatting
		
			Paper
			
•
			2505.22854
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and
  Understanding
		
			Paper
			
•
			2506.01853
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
			
			Pro3D-Editor : A Progressive-Views Perspective for Consistent and
  Precise 3D Editing
		
			Paper
			
•
			2506.00512
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			FlexPainter: Flexible and Multi-View Consistent Texture Generation
		
			Paper
			
•
			2506.02620
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
		
			Paper
			
•
			2506.05327
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Aligning Text, Images, and 3D Structure Token-by-Token
		
			Paper
			
•
			2506.08002
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			EmbodiedGen: Towards a Generative 3D World Engine for Embodied
  Intelligence
		
			Paper
			
•
			2506.10600
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated
  Video Streams
		
			Paper
			
•
			2506.08862
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Test3R: Learning to Reconstruct 3D at Test Time
		
			Paper
			
•
			2506.13750
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with
  Hybrid History Condition
		
			Paper
			
•
			2506.17201
			
•
			Published
				
			•
				
				56
			
 
	
	 
	
	
	
			
			DreamCube: 3D Panorama Generation via Multi-plane Synchronization
		
			Paper
			
•
			2506.17206
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate
  Details
		
			Paper
			
•
			2506.16504
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with
  Production-Ready PBR Material
		
			Paper
			
•
			2506.15442
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			3D Arena: An Open Platform for Generative 3D Evaluation
		
			Paper
			
•
			2506.18787
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion
  Models
		
			Paper
			
•
			2506.19851
			
•
			Published
				
			•
				
				60
			
 
	
	 
	
	
	
			
			PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for
  Realistic Articulated Object Modeling
		
			Paper
			
•
			2506.20936
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing
		
			Paper
			
•
			2506.17450
			
•
			Published
				
			•
				
				63
			
 
	
	 
	
	
	
			
			LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with
  TriMap Video Diffusion
		
			Paper
			
•
			2507.02813
			
•
			Published
				
			•
				
				60
			
 
	
	 
	
	
	
			
			SeqTex: Generate Mesh Textures in Video Sequence
		
			Paper
			
•
			2507.04285
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+
  FPS
		
			Paper
			
•
			2507.07136
			
•
			Published
				
			•
				
				38
			
 
	
	 
	
	
	
			
			From One to More: Contextual Part Latents for 3D Generation
		
			Paper
			
•
			2507.08772
			
•
			Published
				
			•
				
				25
			
 
	
	 
	
	
	
			
			Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos
  with Spatio-Temporal Diffusion Models
		
			Paper
			
•
			2507.13344
			
•
			Published
				
			•
				
				56
			
 
	
	 
	
	
	
			
			Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with
  Regularized Score Distillation Sampling
		
			Paper
			
•
			2507.11061
			
•
			Published
				
			•
				
				37
			
 
	
	 
	
	
	
			
			Gaussian Splatting with Discretized SDF for Relightable Assets
		
			Paper
			
•
			2507.15629
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention
		
			Paper
			
•
			2507.17745
			
•
			Published
				
			•
				
				34
			
 
	
	 
	
	
	
			
			Elevating 3D Models: High-Quality Texture and Geometry Refinement from a
  Low-Quality Model
		
			Paper
			
•
			2507.11465
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D
  Worlds from Words or Pixels
		
			Paper
			
•
			2507.21809
			
•
			Published
				
			•
				
				131
			
 
	
	 
	
	
	
			
			BANG: Dividing 3D Assets via Generative Exploded Dynamics
		
			Paper
			
•
			2507.21493
			
•
			Published
				
			•
				
				64
			
 
	
	 
	
	
	
			
			3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
		
			Paper
			
•
			2507.23478
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Dens3R: A Foundation Model for 3D Geometry Prediction
		
			Paper
			
•
			2507.16290
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Gaussian Variation Field Diffusion for High-fidelity Video-to-4D
  Synthesis
		
			Paper
			
•
			2507.23785
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior
		
			Paper
			
•
			2508.00599
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D
  Generation
		
			Paper
			
•
			2508.00428
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			MeshLLM: Empowering Large Language Models to Progressively Understand
  and Generate 3D Mesh
		
			Paper
			
•
			2508.01242
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Matrix-3D: Omnidirectional Explorable 3D World Generation
		
			Paper
			
•
			2508.08086
			
•
			Published
				
			•
				
				75
			
 
	
	 
	
	
	
			
			VertexRegen: Mesh Generation with Continuous Level of Detail
		
			Paper
			
•
			2508.09062
			
•
			Published
				
			•
				
				37
			
 
	
	 
	
	
	
			
			StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image
  Translation
		
			Paper
			
•
			2508.11203
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			TexVerse: A Universe of 3D Objects with High-Resolution Textures
		
			Paper
			
•
			2508.10868
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			4DNeX: Feed-Forward 4D Generative Modeling Made Easy
		
			Paper
			
•
			2508.13154
			
•
			Published
				
			•
				
				62
			
 
	
	 
	
	
	
			
			SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
		
			Paper
			
•
			2508.15769
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			MV-RAG: Retrieval Augmented Multiview Diffusion
		
			Paper
			
•
			2508.16577
			
•
			Published
				
			•
				
				38
			
 
	
	 
	
	
	
			
			VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D
  Space
		
			Paper
			
•
			2508.19247
			
•
			Published
				
			•
				
				41
			
 
	
	 
	
	
	
			
			Pixie: Fast and Generalizable Supervised Learning of 3D Physics from
  Pixels
		
			Paper
			
•
			2508.17437
			
•
			Published
				
			•
				
				36
			
 
	
	 
	
	
	
			
			FastMesh:Efficient Artistic Mesh Generation via Component Decoupling
		
			Paper
			
•
			2508.19188
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion
  Models
		
			Paper
			
•
			2508.18271
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Collaborative Multi-Modal Coding for High-Quality 3D Generation
		
			Paper
			
•
			2508.15228
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			P3-SAM: Native 3D Part Segmentation
		
			Paper
			
•
			2509.06784
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			X-Part: high fidelity and structure coherent shape decomposition
		
			Paper
			
•
			2509.08643
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			3D Aware Region Prompted Vision Language Model
		
			Paper
			
•
			2509.13317
			
•
			Published
				
			•
				
				14
			
 
	
	 
	
	
	
			
			SPATIALGEN: Layout-guided 3D Indoor Scene Generation
		
			Paper
			
•
			2509.14981
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model
  Self-Distillation
		
			Paper
			
•
			2509.19296
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface
  Reconstruction
		
			Paper
			
•
			2509.18090
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks
		
			Paper
			
•
			2510.15019
			
•
			Published
				
			•
				
				62
			
 
	
	 
	
	
	
			
			FlashWorld: High-quality 3D Scene Generation within Seconds
		
			Paper
			
•
			2510.13678
			
•
			Published
				
			•
				
				70