Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a collection about 1 month ago

View all activity

Organizations

Papers 27

arXiv:2509.15207

arXiv:2509.09674

arXiv:2509.08827

arXiv:2509.04419

models 0

None public yet

datasets 0

None public yet