12 14 6

Ding

dyyyyyyyy

AI & ML interests

None yet

Recent Activity

liked a Space 22 days ago

ISEEKYAN/megatron_memory_estimator

new activity 2 months ago

dyyyyyyyy/FAPO-Critic:Add task categories, tags, paper link, and sample usage

new activity 2 months ago

dyyyyyyyy/FAPO-GenRM-4B:Improve model card: Add pipeline tag, library name, paper link, and abstract

View all activity

Organizations

liked a Space 22 days ago

Megatron Memory Estimator

👁

Estimate GPU memory usage for Megatron models

New activity in dyyyyyyyy/FAPO-Critic 2 months ago

Add task categories, tags, paper link, and sample usage

#1 opened 2 months ago by

nielsr

New activity in dyyyyyyyy/FAPO-GenRM-4B 2 months ago

Improve model card: Add pipeline tag, library name, paper link, and abstract

#1 opened 2 months ago by

nielsr

authored a paper 2 months ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Paper • 2510.22543 • Published Oct 26, 2025 • 11

commented a paper 2 months ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Paper • 2510.22543 • Published Oct 26, 2025 • 11 •

updated 2 datasets 2 months ago

dyyyyyyyy/FAPO-Reasoning-Dataset

Viewer • Updated Oct 28, 2025 • 351k • 78

dyyyyyyyy/FAPO-Critic

Viewer • Updated Oct 31, 2025 • 87k • 61

updated 2 models 2 months ago

dyyyyyyyy/FAPO-32B

33B • Updated Oct 28, 2025 • 8 • 1

dyyyyyyyy/FAPO-GenRM-4B

Text Generation • 4B • Updated Oct 31, 2025 • 23 • 1

updated a collection 2 months ago

FAPO

Collection

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. Project Page: https://fapo-rl.github.io/ • 4 items • Updated Oct 24, 2025

published a model 2 months ago

dyyyyyyyy/FAPO-32B

33B • Updated Oct 28, 2025 • 8 • 1

published a dataset 2 months ago

dyyyyyyyy/FAPO-Reasoning-Dataset

Viewer • Updated Oct 28, 2025 • 351k • 78

updated a collection 2 months ago

FAPO

Collection

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. Project Page: https://fapo-rl.github.io/ • 4 items • Updated Oct 24, 2025

published a model 2 months ago

dyyyyyyyy/FAPO-GenRM-4B

Text Generation • 4B • Updated Oct 31, 2025 • 23 • 1

published a dataset 2 months ago

dyyyyyyyy/FAPO-Critic

Viewer • Updated Oct 31, 2025 • 87k • 61

upvoted a paper 3 months ago

Revisiting Long-context Modeling from Context Denoising Perspective

Paper • 2510.05862 • Published Oct 7, 2025 • 20

authored a paper 3 months ago

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning

Paper • 2509.16548 • Published Sep 20, 2025

commented a paper 3 months ago