Yuxiang Zhang's picture

2 10

Yuxiang Zhang

TokerZ

·

AI & ML interests

LLM-based Agent, RL, Large Reasoning Model

Recent Activity

upvoted a paper 9 days ago

The Era of Agentic Organization: Learning to Organize with Language Models

upvoted a paper 20 days ago

DeepSeek-OCR: Contexts Optical Compression

upvoted a paper 23 days ago

Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI

View all activity

Organizations

None yet

upvoted a paper 9 days ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published 13 days ago • 24

upvoted a paper 20 days ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 23 days ago • 72

upvoted a paper 23 days ago

Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI

Paper • 2510.16720 • Published 25 days ago • 6

upvoted a paper 29 days ago

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Paper • 2510.12635 • Published 29 days ago • 15

upvoted a paper 4 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16 • 18

upvoted a paper 6 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

upvoted 3 papers 8 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 141

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published Mar 9 • 19

upvoted a paper 11 months ago

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9