SEPT: Towards Efficient Scene Representation Learning for Motion Prediction Paper • 2309.15289 • Published Sep 26, 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Paper • 2310.02054 • Published Oct 3, 2023 • 1
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models Paper • 2310.08582 • Published Oct 12, 2023 • 3
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought Paper • 2305.15021 • Published May 24, 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL Paper • 2305.19923 • Published May 31, 2023
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners Paper • 2302.01877 • Published Feb 3, 2023
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model Paper • 2408.09559 • Published Aug 18, 2024
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) Paper • 2409.02920 • Published Sep 4, 2024
Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking Paper • 2409.16287 • Published Sep 24, 2024
AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems Paper • 2503.06669 • Published Mar 9 • 2
G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation Paper • 2411.18369 • Published Nov 27, 2024
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Paper • 2502.21257 • Published Feb 28 • 2
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis Paper • 2506.04217 • Published Jun 4 • 1
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Paper • 2506.18088 • Published Jun 22 • 18
DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation Paper • 2411.18562 • Published Nov 27, 2024
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Paper • 2504.13059 • Published Apr 17
HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents Paper • 2508.02629 • Published Aug 4 • 5
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies Paper • 2508.20072 • Published Aug 27 • 31
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning Paper • 2509.09674 • Published Sep 11 • 79