arxiv:2505.13438
Chao Du
duchao
AI & ML interests
Generative Modeling & Trustworthy ML
Recent Activity
upvoted
a
paper
about 1 month ago
Variational Reasoning for Language Models
upvoted
a
paper
about 1 month ago
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
upvoted
a
paper
2 months ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Organizations
None yet