A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 12
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation Paper • 2503.24026 • Published Mar 31
RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer Paper • 2505.23171 • Published May 29 • 3
WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration Paper • 2506.20590 • Published Jun 25
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models Paper • 2510.01623 • Published Oct 2 • 9
Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization Paper • 2509.10140 • Published Sep 12 • 2
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion Paper • 2510.15264 • Published 29 days ago • 1
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation Paper • 2412.11183 • Published Dec 15, 2024
GigaBrain-0: A World Model-Powered Vision-Language-Action Model Paper • 2510.19430 • Published 23 days ago • 44
GigaBrain-0: A World Model-Powered Vision-Language-Action Model Paper • 2510.19430 • Published 23 days ago • 44 • 5
GigaBrain-0: A World Model-Powered Vision-Language-Action Model Paper • 2510.19430 • Published 23 days ago • 44
GigaBrain-0: A World Model-Powered Vision-Language-Action Model Paper • 2510.19430 • Published 23 days ago • 44 • 5
HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration Paper • 2504.03536 • Published Apr 4 • 13
HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration Paper • 2504.03536 • Published Apr 4 • 13