arXiv:2507.12415
Mingzhe Du PRO
Elfsong
AI & ML interests
Code Generation / Preference Alignment / Bias Mitigation
Recent Activity
upvoted
a
paper
about 16 hours ago
Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps
via Uncertainty Heads
upvoted
a
paper
about 1 month ago
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning
Systems in LLMs
upvoted
a
paper
about 1 month ago
ExGRPO: Learning to Reason from Experience