Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published 6 days ago • 186
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph Paper • 2511.00086 • Published 14 days ago • 40
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation Paper • 2510.22115 • Published 19 days ago • 81
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published 16 days ago • 95
InteractComp: Evaluating Search Agents With Ambiguous Queries Paper • 2510.24668 • Published 15 days ago • 96
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published 15 days ago • 65
ReCode: Unify Plan and Action for Universal Granularity Control Paper • 2510.23564 • Published 16 days ago • 119
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published 16 days ago • 172
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 27 days ago • 47
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published 19 days ago • 94
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Paper • 2510.20579 • Published 20 days ago • 54
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 22 days ago • 59
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 22 days ago • 111
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper • 2510.18927 • Published 22 days ago • 82