Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
GarroshIcecream 's Collections
P(DOOM) = 1.0
READ ON TOILET
English? __Pfff__
Awesome papers

READ ON TOILET

updated Oct 3
Upvote
-

  • Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

    Paper • 2508.09834 • Published Aug 13 • 53

  • The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

    Paper • 2509.02547 • Published Sep 2 • 224

  • DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

    Paper • 2509.25454 • Published Sep 29 • 138

  • DeMo: Decoupled Momentum Optimization

    Paper • 2411.19870 • Published Nov 29, 2024 • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs