Fang Wu's picture

7 14 7

Fang Wu

fangwu97

·

https://smiles724.github.io/

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

HuggingFaceTB/smol-training-playbook

upvoted a paper 9 days ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

commented on a paper 9 days ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

View all activity

Organizations

commented a paper 9 days ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

Paper • 2510.20976 • Published 16 days ago • 2 •

New activity in fangwu97/DeepSearch-1.5B 20 days ago

Could you share the training code?

#2 opened about 1 month ago by

New activity in fangwu97/DeepSearch-1.5B about 1 month ago

Add pipeline tag and hyperlink paper in model card

#1 opened about 1 month ago by

commented 2 papers about 1 month ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 136 •

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27 • 61 •

commented 4 papers 4 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84 •

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84 •

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84 •

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84 •

commented a paper 5 months ago

When to Trust Context: Self-Reflective Debates for Context Reliability

Paper • 2506.06020 • Published Jun 6 • 1 •