PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning Paper • 2508.21104 • Published Aug 28 • 35
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published Aug 27 • 25
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes Paper • 2508.19060 • Published Aug 26 • 10
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents Paper • 2508.17198 • Published Aug 24 • 9
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench Paper • 2508.20931 • Published Aug 28 • 15
Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities Paper • 2508.19562 • Published Aug 27 • 2