Worthy-reads - a marekprachar Collection

marekprachar 's Collections

Worthy-reads

updated Oct 2

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16 • 70
Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25 • 87
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published Sep 30 • 15