arXiv:2509.25154
Dawei Li
wjldw
AI & ML interests
LLM, NLP, Data Mining
Recent Activity
upvoted
a
paper
3 days ago
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
upvoted
a
paper
about 1 month ago
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified
Rewards in World Simulators
authored
a paper
about 1 month ago
DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for
Efficient Large Reasoning Models