Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph Paper • 2511.00086 • Published 11 days ago • 40
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation Paper • 2510.22115 • Published 15 days ago • 80
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published 13 days ago • 95
InteractComp: Evaluating Search Agents With Ambiguous Queries Paper • 2510.24668 • Published 12 days ago • 96
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published 12 days ago • 65
ReCode: Unify Plan and Action for Universal Granularity Control Paper • 2510.23564 • Published 13 days ago • 118
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published 13 days ago • 172
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 24 days ago • 45
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published 16 days ago • 92
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Paper • 2510.20579 • Published 17 days ago • 54
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 18 days ago • 59
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 18 days ago • 110
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper • 2510.18927 • Published 19 days ago • 82
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published 20 days ago • 117
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 19 days ago • 107
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper • 2510.15444 • Published 23 days ago • 145