Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
charles's picture
4

charles

Aira666

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
upvoted a paper 27 days ago
Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony
upvoted a paper 5 months ago
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
View all activity

Organizations

None yet

upvoted a paper 25 days ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published 26 days ago • 56
upvoted a paper 27 days ago

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Paper • 2510.11345 • Published 28 days ago • 15
upvoted a paper 5 months ago

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Paper • 2506.06122 • Published Jun 6 • 7
upvoted a paper 11 months ago

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Paper • 2411.18478 • Published Nov 27, 2024 • 37
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs