Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors Paper • 2505.24625 • Published May 30 • 9
Towards Learning a Generalist Model for Embodied Navigation Paper • 2312.02010 • Published Dec 4, 2023
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding Paper • 2412.00493 • Published Nov 30, 2024 • 17