Wenhan Ma's picture

1 3 13

Wenhan Ma

CuteNPC

·

https://github.com/CuteNPC

CuteNPC

AI & ML interests

Large Language Model

Recent Activity

authored a paper 19 days ago

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

authored a paper 5 months ago

MiMo-VL Technical Report

upvoted a paper 5 months ago

Reinforcement Pre-Training

View all activity

Organizations

None yet

authored a paper 19 days ago

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

Paper • 2510.11370 • Published Oct 13 • 2

authored a paper 5 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 80

authored a paper 6 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82