FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 3 days ago • 16
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 3 days ago • 19
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 4 days ago • 20
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published 4 days ago • 24
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning Paper • 2601.09708 • Published 4 days ago • 47
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 6 days ago • 43
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published 10 days ago • 44
Orient Anything V2: Unifying Orientation and Rotation Understanding Paper • 2601.05573 • Published 9 days ago • 8
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 10 days ago • 44
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 10 days ago • 193
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 14 days ago • 41
Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction Paper • 2601.04090 • Published 11 days ago • 1
RGS-SLAM: Robust Gaussian Splatting SLAM with One-Shot Dense Initialization Paper • 2601.00705 • Published 21 days ago • 3
Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models Paper • 2601.01321 • Published 15 days ago • 17
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 12 days ago • 26
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 12 days ago • 95