The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
published
a model
4 days ago
TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280
updated
a model
4 days ago
TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280
upvoted
a
paper
about 1 month ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning