Dawei Li's picture

4 45 1

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Recent Activity

upvoted a paper 3 days ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

upvoted a paper about 1 month ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

authored a paper about 1 month ago

DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models

View all activity

Organizations

Papers 14

arXiv:2509.25154

arXiv:2508.19570

arXiv:2508.01191

arXiv:2505.18759

models 9

wjldw/Qwen2.5-14B_gemini_sft_30000

Text Generation • 15B • Updated Jul 29 • 3

wjldw/Qwen2.5-14B_gpt4_sft_30000

Text Generation • 15B • Updated Jul 29 • 4

wjldw/bert_classifier

0.1B • Updated Jan 22

wjldw/Mistral-7B-v0.1_gemini_dpo_30000

Text Generation • 7B • Updated Jan 2

wjldw/Mistral-7B-v0.1_gpt4_dpo_30000

Text Generation • 7B • Updated Jan 2

wjldw/Mistral-7B-v0.1_llama_dpo_30000

Text Generation • 7B • Updated Jan 2

wjldw/Mistral-7B-v0.1_gemini_sft_30000

Text Generation • 7B • Updated Dec 26, 2024

wjldw/Mistral-7B-v0.1_gpt4_sft_30000

Text Generation • 7B • Updated Dec 26, 2024 • 3

wjldw/Mistral-7B-v0.1_llama_sft_30000

Text Generation • 7B • Updated Dec 26, 2024 • 1

datasets 1

wjldw/JD-Bench

Viewer • Updated Sep 29 • 42k • 38