DavidDeng

ZiHDeng

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago

Zihan1004/FNSPID

upvoted an article 24 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper 26 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

View all activity

Organizations

None yet

liked a dataset 10 days ago

Zihan1004/FNSPID

Preview • Updated Apr 9, 2024 • 2.45k • 89

upvoted an article 24 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

575

upvoted a paper 26 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published 30 days ago • 72

liked a dataset 8 months ago

jylins/videoxum

Viewer • Updated Apr 22, 2024 • 14k • 209 • 14

upvoted an article 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

271

liked a model 10 months ago

jingyaogong/MiniMind2

0.1B • Updated Dec 12, 2025 • 661 • 83

upvoted a paper 11 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

updated 3 models almost 2 years ago

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v6

Viewer • Updated Feb 7, 2024 • 6.21k • 3

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-0204

Updated Feb 4, 2024 • 6

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v5

Viewer • Updated Feb 4, 2024 • 1.66k • 5

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-0202

Updated Feb 2, 2024 • 5

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v4

Viewer • Updated Feb 2, 2024 • 1.66k • 9

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-MIX-2000

Updated Jan 30, 2024 • 2

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v3

Viewer • Updated Jan 30, 2024 • 8.87k • 10

updated 3 models almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-MIX

Updated Jan 30, 2024 • 6

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-FIM

Updated Jan 29, 2024 • 2

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8

Updated Jan 28, 2024 • 11

DavidDeng

AI & ML interests

Recent Activity

Organizations

ZiHDeng's activity

We Got Claude to Fine-Tune an Open Source LLM

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge