3d - a zzfive Collection

TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion

Paper • 2401.09416 • Published Jan 17, 2024 • 11

SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild

Paper • 2401.10171 • Published Jan 18, 2024 • 14

DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model

Paper • 2311.09217 • Published Nov 15, 2023 • 22

GALA: Generating Animatable Layered Assets from a Single Scan

Paper • 2401.12979 • Published Jan 23, 2024 • 9

ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields

Paper • 2401.17895 • Published Jan 31, 2024 • 16

Advances in 3D Generation: A Survey

Paper • 2401.17807 • Published Jan 31, 2024 • 19

AToM: Amortized Text-to-Mesh using 2D Diffusion

Paper • 2402.00867 • Published Feb 1, 2024 • 11

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Paper • 2402.10259 • Published Feb 15, 2024 • 16

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

Paper • 2402.12712 • Published Feb 20, 2024 • 18

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28, 2024 • 19

Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation

Paper • 2403.19319 • Published Mar 28, 2024 • 14

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

Paper • 2404.00987 • Published Apr 1, 2024 • 23

PointInfinity: Resolution-Invariant Point Diffusion Models

Paper • 2404.03566 • Published Apr 4, 2024 • 16

Robust Gaussian Splatting

Paper • 2404.04211 • Published Apr 5, 2024 • 10

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion

Paper • 2404.06429 • Published Apr 9, 2024 • 7

MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

Paper • 2404.08252 • Published Apr 12, 2024 • 6

CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting

Paper • 2404.09458 • Published Apr 15, 2024 • 7

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Paper • 2404.09995 • Published Apr 15, 2024 • 7

MeshLRM: Large Reconstruction Model for High-Quality Mesh

Paper • 2404.12385 • Published Apr 18, 2024 • 27

Interactive3D: Create What You Want by Interactive 3D Generation

Paper • 2404.16510 • Published Apr 25, 2024 • 21

CAT3D: Create Anything in 3D with Multi-View Diffusion Models

Paper • 2405.10314 • Published May 16, 2024 • 48

Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion

Paper • 2405.09874 • Published May 16, 2024 • 20

Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching

Paper • 2405.11252 • Published May 18, 2024 • 16

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Paper • 2405.14979 • Published May 23, 2024 • 19

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

Paper • 2405.15125 • Published May 24, 2024 • 8

Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels

Paper • 2405.16822 • Published May 27, 2024 • 12

Part123: Part-aware 3D Reconstruction from a Single-view Image

Paper • 2405.16888 • Published May 27, 2024 • 12

GFlow: Recovering 4D World from Monocular Video

Paper • 2405.18426 • Published May 28, 2024 • 17

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Paper • 2405.18424 • Published May 28, 2024 • 9

NPGA: Neural Parametric Gaussian Avatars

Paper • 2405.19331 • Published May 29, 2024 • 10

GECO: Generative Image-to-3D within a SECOnd

Paper • 2405.20327 • Published May 30, 2024 • 11

PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting

Paper • 2405.19957 • Published May 30, 2024 • 10

4Diffusion: Multi-view Video Diffusion Model for 4D Generation

Paper • 2405.20674 • Published May 31, 2024 • 15

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Paper • 2406.03184 • Published Jun 5, 2024 • 22

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Paper • 2406.07472 • Published Jun 11, 2024 • 13

Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Paper • 2406.04338 • Published Jun 6, 2024 • 39

3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination

Paper • 2406.05132 • Published Jun 7, 2024 • 30

Real3D: Scaling Up Large Reconstruction Models with Real-World Images

Paper • 2406.08479 • Published Jun 12, 2024 • 7

LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Paper • 2406.09371 • Published Jun 13, 2024 • 5

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

Paper • 2406.10111 • Published Jun 14, 2024 • 6

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers

Paper • 2406.10163 • Published Jun 14, 2024 • 33

L4GM: Large 4D Gaussian Reconstruction Model

Paper • 2406.10324 • Published Jun 14, 2024 • 13

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

Paper • 2406.16815 • Published Jun 24, 2024 • 7

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

Paper • 2406.16273 • Published Jun 24, 2024 • 43

GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

Paper • 2406.18462 • Published Jun 26, 2024 • 12

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

Paper • 2407.06191 • Published Jul 8, 2024 • 14

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Paper • 2407.06938 • Published Jul 9, 2024 • 25

Controlling Space and Time with Diffusion Models

Paper • 2407.07860 • Published Jul 10, 2024 • 17

StyleSplat: 3D Object Style Transfer with Gaussian Splatting

Paper • 2407.09473 • Published Jul 12, 2024 • 13

CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

Paper • 2402.17214 • Published Feb 27, 2024 • 2

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Paper • 2407.11394 • Published Jul 16, 2024 • 12

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Paper • 2407.11398 • Published Jul 16, 2024 • 10

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Paper • 2407.11793 • Published Jul 16, 2024 • 3

Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections

Paper • 2407.12306 • Published Jul 17, 2024 • 6

Shape of Motion: 4D Reconstruction from a Single Video

Paper • 2407.13764 • Published Jul 18, 2024 • 20

PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Paper • 2407.13976 • Published Jul 19, 2024 • 5

BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes

Paper • 2407.15848 • Published Jul 22, 2024 • 17

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

Paper • 2407.15187 • Published Jul 21, 2024 • 13

Temporal Residual Jacobians For Rig-free Motion Transfer

Paper • 2407.14958 • Published Jul 20, 2024 • 5

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

Paper • 2407.12435 • Published Jul 17, 2024 • 14

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

Paper • 2407.17470 • Published Jul 24, 2024 • 16

DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction

Paper • 2407.16988 • Published Jul 24, 2024 • 9

Floating No More: Object-Ground Reconstruction from a Single Image

Paper • 2407.18914 • Published Jul 26, 2024 • 20

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28, 2024 • 27

Expressive Whole-Body 3D Gaussian Avatar

Paper • 2407.21686 • Published Jul 31, 2024 • 8

Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Paper • 2407.20229 • Published Jul 29, 2024 • 7

NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Paper • 2404.01300 • Published Apr 1, 2024 • 4

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Paper • 2408.00653 • Published Aug 1, 2024 • 32

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

Paper • 2408.01291 • Published Aug 2, 2024 • 13

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 32

An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion

Paper • 2408.03178 • Published Aug 6, 2024 • 40

RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Paper • 2408.03356 • Published Aug 6, 2024 • 10

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7, 2024 • 14

Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

Paper • 2408.04567 • Published Aug 8, 2024 • 26

FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework

Paper • 2408.06190 • Published Aug 12, 2024 • 18

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12, 2024 • 15

SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

Paper • 2408.06697 • Published Aug 13, 2024 • 15

3D Gaussian Editing with A Single Image

Paper • 2408.07540 • Published Aug 14, 2024 • 12

MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing

Paper • 2408.08000 • Published Aug 15, 2024 • 9

MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model

Paper • 2408.10198 • Published Aug 19, 2024 • 35

SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views

Paper • 2408.10195 • Published Aug 19, 2024 • 13

ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining

Paper • 2408.10906 • Published Aug 20, 2024 • 3

DreamCinema: Cinematic Transfer with Free Camera and 3D Character

Paper • 2408.12601 • Published Aug 22, 2024 • 31

Subsurface Scattering for 3D Gaussian Splatting

Paper • 2408.12282 • Published Aug 22, 2024 • 7

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Paper • 2408.13252 • Published Aug 23, 2024 • 26

T3M: Text Guided 3D Human Motion Synthesis from Speech

Paper • 2408.12885 • Published Aug 23, 2024 • 13

FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering

Paper • 2408.12894 • Published Aug 23, 2024 • 6

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Paper • 2408.14211 • Published Aug 26, 2024 • 11

Towards Realistic Example-based Modeling via 3D Gaussian Stitching

Paper • 2408.15708 • Published Aug 28, 2024 • 8

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

Paper • 2408.16767 • Published Aug 29, 2024 • 32

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29, 2024 • 28

3D Reconstruction with Spatial Memory

Paper • 2408.16061 • Published Aug 28, 2024 • 15

GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers

Paper • 2409.04196 • Published Sep 6, 2024 • 16

UniDet3D: Multi-dataset Indoor 3D Object Detection

Paper • 2409.04234 • Published Sep 6, 2024 • 9

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models

Paper • 2409.07452 • Published Sep 11, 2024 • 21

FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally

Paper • 2409.08270 • Published Sep 12, 2024 • 12

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

Paper • 2409.11211 • Published Sep 17, 2024 • 9

Vista3D: Unravel the 3D Darkside of a Single Image

Paper • 2409.12193 • Published Sep 18, 2024 • 10

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Paper • 2409.12957 • Published Sep 19, 2024 • 21

FlexiTex: Enhancing Texture Generation with Visual Guidance

Paper • 2409.12431 • Published Sep 19, 2024 • 13

3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt

Paper • 2409.12892 • Published Sep 19, 2024 • 5

Portrait Video Editing Empowered by Multimodal Generative Priors

Paper • 2409.13591 • Published Sep 20, 2024 • 17

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Paper • 2409.17145 • Published Sep 25, 2024 • 15

Game4Loc: A UAV Geo-Localization Benchmark from Game Data

Paper • 2409.16925 • Published Sep 25, 2024 • 8

TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans

Paper • 2409.16666 • Published Sep 25, 2024 • 7

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26, 2024 • 34

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Paper • 2409.17280 • Published Sep 25, 2024 • 11

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

Paper • 2410.03825 • Published Oct 4, 2024 • 19

RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models

Paper • 2409.19989 • Published Sep 30, 2024 • 18

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Paper • 2410.09009 • Published Oct 11, 2024 • 15

GS^3: Efficient Relighting with Triple Gaussian Splatting

Paper • 2410.11419 • Published Oct 15, 2024 • 12

Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Paper • 2410.12781 • Published Oct 16, 2024 • 6

FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors

Paper • 2410.16271 • Published Oct 21, 2024 • 84

SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes

Paper • 2410.17249 • Published Oct 22, 2024 • 42

3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors

Paper • 2410.16266 • Published Oct 21, 2024 • 5

DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes

Paper • 2410.18084 • Published Oct 23, 2024 • 14

LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias

Paper • 2410.17242 • Published Oct 22, 2024 • 5

MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms

Paper • 2410.18977 • Published Oct 24, 2024 • 15

Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling

Paper • 2410.18912 • Published Oct 24, 2024 • 6

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published Nov 4, 2024 • 24

GenXD: Generating Any 3D and 4D Scenes

Paper • 2411.02319 • Published Nov 4, 2024 • 20

AutoVFX: Physically Realistic Video Editing from Natural Language Instructions

Paper • 2411.02394 • Published Nov 4, 2024 • 17

DreamPolish: Domain Score Distillation With Progressive Geometry Generation

Paper • 2411.01602 • Published Nov 3, 2024 • 11

GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details

Paper • 2411.03047 • Published Nov 5, 2024 • 9

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published Nov 7, 2024 • 57

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15

KMM: Key Frame Mask Mamba for Extended Motion Generation

Paper • 2411.06481 • Published Nov 10, 2024 • 5

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published Nov 11, 2024 • 28

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Paper • 2411.08017 • Published Nov 12, 2024 • 11

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 77

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Paper • 2411.08033 • Published Nov 12, 2024 • 25

VeGaS: Video Gaussian Splatting

Paper • 2411.11024 • Published Nov 17, 2024 • 7

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation

Paper • 2411.14384 • Published Nov 21, 2024 • 9

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 50

SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis

Paper • 2411.16443 • Published Nov 25, 2024 • 12

Find Any Part in 3D

Paper • 2411.13550 • Published Nov 20, 2024 • 7

TEXGen: a Generative Diffusion Model for Mesh Textures

Paper • 2411.14740 • Published Nov 22, 2024 • 18

Learning 3D Representations from Procedural 3D Programs

Paper • 2411.17467 • Published Nov 25, 2024 • 9

SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE

Paper • 2411.16856 • Published Nov 25, 2024 • 13

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Paper • 2411.18613 • Published Nov 27, 2024 • 58

MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation

Paper • 2411.17945 • Published Nov 26, 2024 • 27

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

Paper • 2411.18197 • Published Nov 27, 2024 • 14

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Paper • 2411.19527 • Published Nov 29, 2024 • 11

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

World-consistent Video Diffusion with Explicit 3D Modeling

Paper • 2412.01821 • Published Dec 2, 2024 • 4

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Paper • 2412.03552 • Published Dec 4, 2024 • 29

Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Paper • 2412.03515 • Published Dec 4, 2024 • 27

Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding

Paper • 2412.00493 • Published Nov 30, 2024 • 17

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Paper • 2412.03558 • Published Dec 4, 2024 • 20

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 83

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Paper • 2412.03632 • Published Dec 4, 2024 • 24

Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

Paper • 2412.04887 • Published Dec 6, 2024 • 18

2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction

Paper • 2412.03428 • Published Dec 4, 2024 • 11

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Paper • 2412.06699 • Published Dec 9, 2024 • 13

MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views

Paper • 2412.06767 • Published Dec 9, 2024 • 8

Turbo3D: Ultra-fast Text-to-3D Generation

Paper • 2412.04470 • Published Dec 5, 2024 • 4

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published Dec 12, 2024 • 18

PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations

Paper • 2412.05994 • Published Dec 8, 2024 • 19

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Paper • 2412.12083 • Published Dec 16, 2024 • 12

GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs

Paper • 2412.11258 • Published Dec 15, 2024 • 13

DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

Paper • 2412.15200 • Published Dec 19, 2024 • 9

Sequence Matters: Harnessing Video Models in 3D Super-Resolution

Paper • 2412.11525 • Published Dec 16, 2024 • 11

3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding

Paper • 2412.18450 • Published Dec 24, 2024 • 36

DepthLab: From Partial to Complete

Paper • 2412.18153 • Published Dec 24, 2024 • 36

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

Paper • 2412.18608 • Published Dec 24, 2024 • 18

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Paper • 2412.18605 • Published Dec 24, 2024 • 22

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Paper • 2501.04689 • Published Jan 8 • 17

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

UnCommon Objects in 3D

Paper • 2501.07574 • Published Jan 13 • 13

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 21

CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation

Paper • 2501.09433 • Published Jan 16 • 18

GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor

Paper • 2501.09978 • Published Jan 17 • 6

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 47

GSTAR: Gaussian Surface Tracking and Reconstruction

Paper • 2501.10283 • Published Jan 17 • 5

Relightable Full-Body Gaussian Codec Avatars

Paper • 2501.14726 • Published Jan 24 • 10

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Paper • 2411.19458 • Published Nov 29, 2024 • 6

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Paper • 2501.18804 • Published Jan 30 • 5

Fast Encoder-Based 3D from Casual Videos via Point Track Processing

Paper • 2404.07097 • Published Apr 10, 2024 • 4

Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models

Paper • 2501.19054 • Published Jan 31 • 10

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Paper • 2502.04370 • Published Feb 5 • 7

CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing

Paper • 2502.03997 • Published Feb 6 • 9

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Paper • 2502.09620 • Published Feb 13 • 26

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Paper • 2502.06608 • Published Feb 10 • 40

Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation

Paper • 2502.14247 • Published Feb 20 • 6

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

Paper • 2502.20378 • Published Feb 27 • 5

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published Mar 3 • 44

Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation

Paper • 2503.01370 • Published Mar 3 • 15

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

Paper • 2503.09601 • Published Mar 12 • 16

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Paper • 2503.10437 • Published Mar 13 • 32

TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing

Paper • 2503.11629 • Published Mar 14 • 6

Unleashing Vecset Diffusion Model for Fast Shape Generation

Paper • 2503.16302 • Published Mar 20 • 43

DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19 • 46

DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Paper • 2503.15667 • Published Mar 19 • 8

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Paper • 2503.21732 • Published Mar 27 • 9

Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging

Paper • 2503.22236 • Published Mar 28 • 11

Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data

Paper • 2503.21694 • Published Mar 27 • 15

MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs

Paper • 2503.23022 • Published Mar 29 • 6

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

Paper • 2503.22677 • Published Mar 28 • 5

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Paper • 2504.01956 • Published Apr 2 • 41

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published Apr 10 • 28

In-2-4D: Inbetweening from Two Single-View Images to 4D Generation

Paper • 2504.08366 • Published Apr 11 • 10

InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Paper • 2504.05303 • Published Apr 7 • 5

3D CoCa: Contrastive Learners are 3D Captioners

Paper • 2504.09518 • Published Apr 13 • 5

MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits

Paper • 2504.03767 • Published Apr 2 • 3

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

Paper • 2504.11447 • Published Apr 15 • 4

Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting

Paper • 2504.11092 • Published Apr 15 • 9

BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting

Paper • 2504.09048 • Published Apr 12 • 7

HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation

Paper • 2504.13072 • Published Apr 17 • 13

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

Paper • 2504.15281 • Published Apr 21 • 23

DiMeR: Disentangled Mesh Reconstruction Model

Paper • 2504.17670 • Published Apr 24 • 24

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

Paper • 2504.21650 • Published Apr 30 • 16

Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation

Paper • 2505.02836 • Published May 5 • 8

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Paper • 2505.04622 • Published May 7 • 27

3D Scene Generation: A Survey

Paper • 2505.05474 • Published May 8 • 21

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

Paper • 2505.05288 • Published May 8 • 14

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Paper • 2505.07747 • Published May 12 • 61

Constructing a 3D Town from a Single Image

Paper • 2505.15765 • Published May 21 • 24

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Paper • 2505.17412 • Published May 23 • 21

Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles

Paper • 2505.21060 • Published May 27 • 4

UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes

Paper • 2505.23253 • Published May 29 • 4

CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting

Paper • 2505.22854 • Published May 28 • 4

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Paper • 2506.01853 • Published Jun 2 • 32

Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing

Paper • 2506.00512 • Published May 31 • 5

FlexPainter: Flexible and Multi-View Consistent Texture Generation

Paper • 2506.02620 • Published Jun 3 • 14

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting

Paper • 2506.05327 • Published Jun 5 • 11

Aligning Text, Images, and 3D Structure Token-by-Token

Paper • 2506.08002 • Published Jun 9 • 21

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

Paper • 2506.10600 • Published Jun 12 • 8

StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Paper • 2506.08862 • Published Jun 10 • 5

Test3R: Learning to Reconstruct 3D at Test Time

Paper • 2506.13750 • Published Jun 16 • 27

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published Jun 20 • 56

DreamCube: 3D Panorama Generation via Multi-plane Synchronization

Paper • 2506.17206 • Published Jun 20 • 23

Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

Paper • 2506.16504 • Published Jun 19 • 26

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Paper • 2506.15442 • Published Jun 18 • 12

3D Arena: An Open Platform for Generative 3D Evaluation

Paper • 2506.18787 • Published Jun 23 • 13

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 60

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

Paper • 2506.20936 • Published Jun 26 • 12

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published Jun 20 • 63

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Paper • 2507.02813 • Published Jul 3 • 60

SeqTex: Generate Mesh Textures in Video Sequence

Paper • 2507.04285 • Published Jul 6 • 9

LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Paper • 2507.07136 • Published Jul 9 • 38

From One to More: Contextual Part Latents for 3D Generation

Paper • 2507.08772 • Published Jul 11 • 25

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17 • 56

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Paper • 2507.11061 • Published Jul 15 • 37

Gaussian Splatting with Discretized SDF for Relightable Assets

Paper • 2507.15629 • Published Jul 21 • 23

Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Paper • 2507.17745 • Published Jul 23 • 34

Elevating 3D Models: High-Quality Texture and Geometry Refinement from a Low-Quality Model

Paper • 2507.11465 • Published Jul 15 • 17

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 131

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Paper • 2507.23478 • Published Jul 31 • 15

Dens3R: A Foundation Model for 3D Geometry Prediction

Paper • 2507.16290 • Published Jul 22 • 8

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published Jul 31 • 18

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Paper • 2508.00599 • Published Aug 1 • 7

Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation

Paper • 2508.00428 • Published Aug 1 • 3

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

Paper • 2508.01242 • Published Aug 2 • 10

Matrix-3D: Omnidirectional Explorable 3D World Generation

Paper • 2508.08086 • Published Aug 11 • 75

VertexRegen: Mesh Generation with Continuous Level of Detail

Paper • 2508.09062 • Published Aug 12 • 37

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation

Paper • 2508.11203 • Published Aug 15 • 10

TexVerse: A Universe of 3D Objects with High-Resolution Textures

Paper • 2508.10868 • Published Aug 14 • 17

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published Aug 18 • 62

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass

Paper • 2508.15769 • Published Aug 21 • 19

MV-RAG: Retrieval Augmented Multiview Diffusion

Paper • 2508.16577 • Published Aug 22 • 38

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26 • 41

Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels

Paper • 2508.17437 • Published Aug 20 • 36

FastMesh:Efficient Artistic Mesh Generation via Component Decoupling

Paper • 2508.19188 • Published Aug 26 • 16

ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

Paper • 2508.18271 • Published Aug 25 • 8

Collaborative Multi-Modal Coding for High-Quality 3D Generation

Paper • 2508.15228 • Published Aug 21 • 4

P3-SAM: Native 3D Part Segmentation

Paper • 2509.06784 • Published Sep 8 • 23

X-Part: high fidelity and structure coherent shape decomposition

Paper • 2509.08643 • Published Sep 10 • 26

3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16 • 14

SPATIALGEN: Layout-guided 3D Indoor Scene Generation

Paper • 2509.14981 • Published Sep 18 • 27

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Paper • 2509.19296 • Published Sep 23 • 22

GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Paper • 2509.18090 • Published Sep 22 • 3

NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

Paper • 2510.15019 • Published 19 days ago • 62

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published 20 days ago • 70