Mingzhe Du's picture

Mingzhe Du PRO

Elfsong

·

https://mingzhe.space

Elfsong

AI & ML interests

Code Generation / Preference Alignment / Bias Mitigation

Recent Activity

upvoted a paper about 16 hours ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

upvoted a paper about 1 month ago

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

upvoted a paper about 1 month ago

ExGRPO: Learning to Reason from Experience

View all activity

Organizations

Papers 7

arXiv:2507.12415

arXiv:2505.23387

arXiv:2505.11049

arXiv:2503.01295

spaces 10

Dixit Bench

Dixit Bench

Monolith

Sandbox for Code Generation

CodeArena

CodeArena

Argus

NUSBUS

Check bus and train arrival times for NUS and nearby stations

Lucky Reactor

Manage server activation and termination

models 34

Elfsong/Llama-3.1-8B-Instruct-QG-SFT-Model

Text Generation • 8B • Updated Sep 23 • 13

Elfsong/Afterburner_3B_100

3B • Updated May 5

Elfsong/Afterburner_3B_120

3B • Updated May 5 • 1

Elfsong/Qwen2.5-Coder-3B-Instruct-Venus-Cold-Start

Text Generation • 3B • Updated Apr 29 • 2

Elfsong/qwen_3b_sft_dpo_batch_2_ga_8_lr_4e5_checkpoint_1200

Text Generation • 3B • Updated Apr 13 • 2

Elfsong/Qwen2.5-3B-DPO-Batch-8-LR-4e-5

Text Generation • 3B • Updated Apr 12

Elfsong/Qwen2.5-3B-SFT-Batch-8-LR-3e-5

Text Generation • 3B • Updated Apr 12

Elfsong/Qwen2.5-3B-Instruct-DPO-Venus

Text Generation • 3B • Updated Apr 11 • 2

Elfsong/Qwen2.5-Coder-7B-Instruct-GRPO-test

Elfsong/Qwen2.5-Coder-14B-Instruct-GRPO-test

datasets 79

Elfsong/swe-perf-reasoning

Viewer • Updated Sep 5 • 292 • 76 • 1

Elfsong/swe-perf

Updated Aug 26 • 4

Elfsong/cisco_cr

Viewer • Updated Aug 25 • 104 • 2

Elfsong/Venus

Viewer • Updated Aug 18 • 9.3k • 109 • 6

Elfsong/new_cisco_cr_uc3

Viewer • Updated Aug 11 • 39 • 9

Elfsong/Venus_Anotation

Viewer • Updated Jul 28 • 12 • 41

Elfsong/cisco-cr-changes

Viewer • Updated Jul 23 • 52 • 4

Elfsong/cisco-cr

Viewer • Updated Jul 21 • 39 • 10

Elfsong/Venus_Python

Viewer • Updated Apr 16 • 1.28k • 47

Elfsong/Venus_Python_GRPO_Reasoning_Cold_Start

Viewer • Updated Apr 14 • 898 • 7

View 79 datasets