World Simulation with Video Foundation Models for Physical AI Paper • 2511.00062 • Published 14 days ago • 39
view post Post 2481 🎉 NEW RELEASES: Cosmos Predict 2.5 and Transfer 2.5Cosmos Predict 2.5:- Combines Text2World, Image2World, and Video2World- Multimodal, future-state video predictionCosmos Transfer 2.5:- High-fidelity multicontrol world simulations- Inputs: RGB, depth, segmentation—blended seamlesslyThese updates boost development of autonomous vehicles, robotics, and video analytics.Don’t miss Jensen Huang’s keynote at NVIDIA GTC Washington, D.C. on 10/28 to hear the latest in physical AI. 📺 Watch live: https://nvda.ws/4pUjF4x 🔗 Try Predict 2.5: https://nvda.ws/4otReZZ 🔗 Try Transfer 2.5: https://nvda.ws/46GEx7T See translation 🔥 4 4 + Reply
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture Paper • 2507.05163 • Published Jul 7 • 41
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published Mar 18 • 20
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets Mar 18 • 42
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published Jan 7 • 81
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Paper • 2412.03517 • Published Dec 4, 2024 • 19
Cached Transformers: Improving Transformers with Differentiable Memory Cache Paper • 2312.12742 • Published Dec 20, 2023 • 14
Cached Transformers: Improving Transformers with Differentiable Memory Cache Paper • 2312.12742 • Published Dec 20, 2023 • 14