PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity Paper • 2510.23603 • Published 11 days ago • 21
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published 25 days ago • 97
High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting Paper • 2510.10637 • Published 26 days ago • 12
TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios Paper • 2505.12891 • Published May 19 • 10
Residual Off-Policy RL for Finetuning Behavior Cloning Policies Paper • 2509.19301 • Published Sep 23 • 18
MMR1 Collection Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources • 1 item • Updated Sep 26 • 1
MMR1 Collection Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources • 1 item • Updated Sep 26 • 1
MMR1 Collection Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources • 1 item • Updated Sep 26 • 1