KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta Paper • 2512.23236 • Published 1 day ago • 2
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 1 day ago • 3 • 1
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 1 day ago • 3
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 4 days ago • 48
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling Paper • 2512.23162 • Published 2 days ago • 7
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling Paper • 2512.23162 • Published 2 days ago • 7
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding Paper • 2512.23646 • Published 1 day ago • 8
Video-BrowseComp: Benchmarking Agentic Video Research on Open Web Paper • 2512.23044 • Published 2 days ago • 9
Video-BrowseComp: Benchmarking Agentic Video Research on Open Web Paper • 2512.23044 • Published 2 days ago • 9
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models Paper • 2512.22238 • Published 7 days ago • 13
An Information Theoretic Perspective on Agentic System Design Paper • 2512.21720 • Published 5 days ago • 6
An Information Theoretic Perspective on Agentic System Design Paper • 2512.21720 • Published 5 days ago • 6
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published 4 days ago • 25
SVBench: Evaluation of Video Generation Models on Social Reasoning Paper • 2512.21507 • Published 6 days ago • 5
SVBench: Evaluation of Video Generation Models on Social Reasoning Paper • 2512.21507 • Published 6 days ago • 5