Heng Lin's picture

6 5

Heng Lin

Heng1999

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

commented on a paper about 1 month ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

new activity about 1 month ago

Heng1999/Qwen3-8B-TIR-ASPO:Is the base model of Qwen3-8B-TIR-ASPO Qwen3-8B or Qwen3-8B-base?

View all activity

Organizations

None yet

upvoted a paper 23 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16 • 47

commented a paper about 1 month ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Paper • 2510.07242 • Published Oct 8 • 30 •

New activity in Heng1999/Qwen3-8B-TIR-ASPO about 1 month ago

Is the base model of Qwen3-8B-TIR-ASPO Qwen3-8B or Qwen3-8B-base?

#2 opened about 1 month ago by

upvoted a paper 2 months ago

Single-stream Policy Optimization

Paper • 2509.13232 • Published Sep 16 • 33

upvoted a collection 3 months ago

Qwen3

84 items • Updated Aug 6 • 1.44k

New activity in Heng1999/Qwen3-8B-TIR-DAPO 3 months ago

Improve model card: Add pipeline tag, library name, and paper metadata

#1 opened 3 months ago by

New activity in Heng1999/Qwen3-8B-TIR-ASPO 3 months ago

Improve model card: Add pipeline tag and library name

#1 opened 3 months ago by

New activity in Heng1999/Omni-MATH-512 3 months ago

Add task category and relevant tags to dataset card

#2 opened 3 months ago by

updated a dataset 3 months ago

Heng1999/dapo-en-10k

Viewer • Updated Sep 1 • 10k • 47

New activity in Heng1999/dapo-en-10k 3 months ago

Update dataset card: Add task category, tags, and improve description

#2 opened 3 months ago by

authored a paper 3 months ago

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26 • 32

updated a model 3 months ago

Heng1999/Qwen3-8B-TIR-DAPO

Text Generation • 8B • Updated Sep 1 • 12

updated a dataset 3 months ago

Heng1999/Omni-MATH-512

Viewer • Updated Sep 1 • 512 • 57 • 1

updated a model 3 months ago

Heng1999/Qwen3-8B-TIR-ASPO

Text Generation • 8B • Updated Sep 1 • 20

upvoted a collection 3 months ago

Understanding Tool-Integrated Reasoning

The official models and datasets for the paper "Understanding Tool-Integrated Reasoning" • 5 items • Updated Aug 27 • 2

updated a collection 3 months ago

Understanding Tool-Integrated Reasoning

The official models and datasets for the paper "Understanding Tool-Integrated Reasoning" • 5 items • Updated Aug 27 • 2

upvoted a paper 3 months ago

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26 • 32

updated a collection 3 months ago

Understanding Tool-Integrated Reasoning

The official models and datasets for the paper "Understanding Tool-Integrated Reasoning" • 5 items • Updated Aug 27 • 2