Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
marekprachar 's Collections
Worthy-reads

Worthy-reads

updated Oct 2
Upvote
-

  • Towards General Agentic Intelligence via Environment Scaling

    Paper • 2509.13311 • Published Sep 16 • 70

  • Tree Search for LLM Agent Reinforcement Learning

    Paper • 2509.21240 • Published Sep 25 • 87

  • Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

    Paper • 2509.25849 • Published Sep 30 • 47

  • Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

    Paper • 2509.26628 • Published Sep 30 • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs