In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published 30 days ago • 97 • 3
Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute Paper • 2506.15882 • Published Jun 18 • 2
Where LLM Agents Fail and How They can Learn From Failures Paper • 2509.25370 • Published Sep 29 • 11
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Paper • 2509.22646 • Published Sep 26 • 16
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper • 2510.06217 • Published 29 days ago • 62
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published 30 days ago • 97
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published 30 days ago • 97
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper • 2510.06217 • Published 29 days ago • 62
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 136
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Paper • 2509.22646 • Published Sep 26 • 16