zzfive
			's Collections
			 
		
			
		robot
		
	updated
			
 
				
				
	
	
	
			
			GRUtopia: Dream General Robots in a City at Scale
		
			Paper
			
•
			2407.10943
			
•
			Published
				
			•
				
				25
			
 
	
	 
	
	
	
			
			Make-An-Agent: A Generalizable Policy Network Generator with
  Behavior-Prompted Diffusion
		
			Paper
			
•
			2407.10973
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Cross Anything: General Quadruped Robot Navigation through Complex
  Terrains
		
			Paper
			
•
			2407.16412
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual
  Dexterous Robot Hands
		
			Paper
			
•
			2408.11048
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			LLM-3D Print: Large Language Models To Monitor and Control 3D Printing
		
			Paper
			
•
			2408.14307
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			In-Context Imitation Learning via Next-Token Prediction
		
			Paper
			
•
			2408.15980
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Diffusion Policy Policy Optimization
		
			Paper
			
•
			2409.00588
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			Affordance-based Robot Manipulation with Flow Matching
		
			Paper
			
•
			2409.01083
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
		
			Paper
			
•
			2409.12192
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Robot See Robot Do: Imitating Articulated Object Manipulation with
  Monocular 4D Reconstruction
		
			Paper
			
•
			2409.18121
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video
  Even in VLMs
		
			Paper
			
•
			2410.16267
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			Data Scaling Laws in Imitation Learning for Robotic Manipulation
		
			Paper
			
•
			2410.18647
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			Neural Fields in Robotics: A Survey
		
			Paper
			
•
			2410.20220
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Robots Pre-train Robots: Manipulation-Centric Robotic Representation
  from Large-Scale Robot Dataset
		
			Paper
			
•
			2410.22325
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			IGOR: Image-GOal Representations are the Atomic Control Units for
  Foundation Models in Embodied AI
		
			Paper
			
•
			2411.00785
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for
  Efficient Robot Execution
		
			Paper
			
•
			2411.02359
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			WildLMa: Long Horizon Loco-Manipulation in the Wild
		
			Paper
			
•
			2411.15131
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			GRAPE: Generalizing Robot Policy via Preference Alignment
		
			Paper
			
•
			2411.19309
			
•
			Published
				
			•
				
				47
			
 
	
	 
	
	
	
			
			Code-as-Monitor: Constraint-aware Visual Programming for Reactive and
  Proactive Robotic Failure Detection
		
			Paper
			
•
			2412.04455
			
•
			Published
				
			•
				
				38
			
 
	
	 
	
	
	
			
			Moto: Latent Motion Token as the Bridging Language for Robot
  Manipulation
		
			Paper
			
•
			2412.04445
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			Emma-X: An Embodied Multimodal Action Model with Grounded Chain of
  Thought and Look-ahead Spatial Reasoning
		
			Paper
			
•
			2412.11974
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot
  Learning
		
			Paper
			
•
			2412.10447
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
		
			Paper
			
•
			2412.09858
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			Efficient Diffusion Transformer Policies with Mixture of Expert
  Denoisers for Multitask Learning
		
			Paper
			
•
			2412.12953
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Prompting Depth Anything for 4K Resolution Accurate Metric Depth
  Estimation
		
			Paper
			
•
			2412.14015
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Learning from Massive Human Videos for Universal Humanoid Pose Control
		
			Paper
			
•
			2412.14172
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
		
			Paper
			
•
			2501.01895
			
•
			Published
				
			•
				
				55
			
 
	
	 
	
	
	
			
			OmniManip: Towards General Robotic Manipulation via Object-Centric
  Interaction Primitives as Spatial Constraints
		
			Paper
			
•
			2501.03841
			
•
			Published
				
			•
				
				56
			
 
	
	 
	
	
	
			
			Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous
  Sensors via Language Grounding
		
			Paper
			
•
			2501.04693
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			FAST: Efficient Action Tokenization for Vision-Language-Action Models
		
			Paper
			
•
			2501.09747
			
•
			Published
				
			•
				
				27
			
 
	
	 
	
	
	
			
			Embodied Red Teaming for Auditing Robotic Foundation Models
		
			Paper
			
•
			2411.18676
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			Learning Getting-Up Policies for Real-World Humanoid Robots
		
			Paper
			
•
			2502.12152
			
•
			Published
				
			•
				
				42
			
 
	
	 
	
	
	
			
			A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
		
			Paper
			
•
			2503.06960
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Being-0: A Humanoid Robotic Agent with Vision-Language Models and
  Modular Skills
		
			Paper
			
•
			2503.12533
			
•
			Published
				
			•
				
				68
			
 
	
	 
	
	
	
			
			Free-form language-based robotic reasoning and grasping
		
			Paper
			
•
			2503.13082
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
		
			Paper
			
•
			2503.15558
			
•
			Published
				
			•
				
				50
			
 
	
	 
	
	
	
			
			Dita: Scaling Diffusion Transformer for Generalist
  Vision-Language-Action Policy
		
			Paper
			
•
			2503.19757
			
•
			Published
				
			•
				
				51
			
 
	
	 
	
	
	
			
			Gemini Robotics: Bringing AI into the Physical World
		
			Paper
			
•
			2503.20020
			
•
			Published
				
			•
				
				29
			
 
	
	 
	
	
	
			
			Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for
  Embodied Interactive Tasks
		
			Paper
			
•
			2503.21696
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			NORA: A Small Open-Sourced Generalist Vision Language Action Model for
  Embodied Tasks
		
			Paper
			
•
			2504.19854
			
•
			Published
				
			•
				
				7
			
 
	
	 
	
	
	
			
			EnerVerse-AC: Envisioning Embodied Environments with Action Condition
		
			Paper
			
•
			2505.09723
			
•
			Published
				
			•
				
				23
			
 
	
	 
	
	
	
			
			SmolVLA: A Vision-Language-Action Model for Affordable and Efficient
  Robotics
		
			Paper
			
•
			2506.01844
			
•
			Published
				
			•
				
				140
			
 
	
	 
	
	
	
			
			Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in
  Robotics
		
			Paper
			
•
			2506.00070
			
•
			Published
				
			•
				
				29
			
 
	
	 
	
	
	
			
			Ark: An Open-source Python-based Framework for Robot Learning
		
			Paper
			
•
			2506.21628
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			RoboScape: Physics-informed Embodied World Model
		
			Paper
			
•
			2506.23135
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			RoboBrain 2.0 Technical Report
		
			Paper
			
•
			2507.02029
			
•
			Published
				
			•
				
				32
			
 
	
	 
	
	
	
		
			Paper
			
•
			2507.15493
			
•
			Published
				
			•
				
				47
			
 
	
	 
	
	
	
			
			Experience is the Best Teacher: Grounding VLMs for Robotics through
  Self-Generated Memory
		
			Paper
			
•
			2507.16713
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks
		
			Paper
			
•
			2508.05614
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			Embodied-R1: Reinforced Embodied Reasoning for General Robotic
  Manipulation
		
			Paper
			
•
			2508.13998
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			RynnEC: Bringing MLLMs into Embodied World
		
			Paper
			
•
			2508.14160
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for
  Long-Horizon Tasks
		
			Paper
			
•
			2508.08240
			
•
			Published
				
			•
				
				45
			
 
	
	 
	
	
	
			
			EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for
  General Robot Control
		
			Paper
			
•
			2508.21112
			
•
			Published
				
			•
				
				75
			
 
	
	 
	
	
	
			
			HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data
  for Mobile Dexterous Manipulation
		
			Paper
			
•
			2508.20085
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Robix: A Unified Model for Robot Interaction, Reasoning and Planning
		
			Paper
			
•
			2509.01106
			
•
			Published
				
			•
				
				48
			
 
	
	 
	
	
	
			
			Manipulation as in Simulation: Enabling Accurate Geometry Perception in
  Robots
		
			Paper
			
•
			2509.02530
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			Nav-R1: Reasoning and Navigation in Embodied Scenes
		
			Paper
			
•
			2509.10884
			
•
			Published
				
			•
				
				6
			
 
	
	 
	
	
	
			
			OceanGym: A Benchmark Environment for Underwater Embodied Agents
		
			Paper
			
•
			2509.26536
			
•
			Published
				
			•
				
				34
			
 
	
	 
	
	
	
			
			Robot Learning: A Tutorial
		
			Paper
			
•
			2510.12403
			
•
			Published
				
			•
				
				99