Collections
Discover the best community collections!
Collections including paper arxiv:2508.03680
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 260 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 220 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 113 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 137 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100 -
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper • 2506.07491 • Published • 50 -
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos
Paper • 2508.14041 • Published • 59
-
Reasoning Language Model Inference Serving Unveiled: An Empirical Study
Paper • 2510.18672 • Published • 7 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100 -
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation
Paper • 2510.22115 • Published • 73
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 137 -
TTT3R: 3D Reconstruction as Test-Time Training
Paper • 2509.26645 • Published • 14 -
Human3R: Everyone Everywhere All at Once
Paper • 2510.06219 • Published • 9 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 464
-
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 31 -
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
Paper • 2509.21710 • Published • 17 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 120 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 189 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 94 -
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
Paper • 2507.06229 • Published • 75
-
Reasoning Language Model Inference Serving Unveiled: An Empirical Study
Paper • 2510.18672 • Published • 7 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100 -
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation
Paper • 2510.22115 • Published • 73
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 260 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 220 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 113 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 137 -
TTT3R: 3D Reconstruction as Test-Time Training
Paper • 2509.26645 • Published • 14 -
Human3R: Everyone Everywhere All at Once
Paper • 2510.06219 • Published • 9 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 464
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 137 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100 -
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper • 2506.07491 • Published • 50 -
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos
Paper • 2508.14041 • Published • 59
-
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 31 -
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
Paper • 2509.21710 • Published • 17 -
TTRL: Test-Time Reinforcement Learning
Paper • 2504.16084 • Published • 120 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 100
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 189 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 94 -
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
Paper • 2507.06229 • Published • 75