39 34 30

Shizhe Diao

shizhediao2

https://shizhediao.github.io/

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper about 1 hour ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

liked a model 2 days ago

nvidia/Nemotron-Flash-1B

updated a dataset 23 days ago

nvidia/ToolScale

View all activity

Organizations

upvoted a paper about 1 hour ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published about 10 hours ago • 46

liked a model 2 days ago

nvidia/Nemotron-Flash-1B

Text Generation • 1.0B • Updated Nov 28, 2025 • 647 • 26

updated a dataset 23 days ago

nvidia/ToolScale

Viewer • Updated 23 days ago • 4.06k • 1.37k • 170

New activity in nvidia/ToolScale 23 days ago

Add metadata and refactor to ToolScale Dataset Card

#3 opened about 1 month ago by

nielsr

posted an update about 1 month ago

Post

151

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

reacted to di-zhang-fdu's post with 🔥 about 1 month ago

Post

1944

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

upvoted 2 papers about 1 month ago

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 32

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 114

New activity in nvidia/Nemotron-Orchestrator-8B about 1 month ago

Adding `transformers` as the library name

#18 opened about 1 month ago by

ariG23498

liked a dataset about 1 month ago

nvidia/ToolScale

Viewer • Updated 23 days ago • 4.06k • 1.37k • 170

published a dataset about 1 month ago

nvidia/ToolScale

Viewer • Updated 23 days ago • 4.06k • 1.37k • 170

liked a model about 1 month ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 58.6k • 482

published a model about 1 month ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 58.6k • 482

updated a model about 1 month ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 58.6k • 482

New activity in nvidia/Nemotron-Orchestrator-8B about 1 month ago

Upload merges.txt with huggingface_hub

#1 opened about 1 month ago by

bestluck123

Upload config.json with huggingface_hub

#2 opened about 1 month ago by

bestluck123

Upload model-00006-of-00007.safetensors with huggingface_hub

#3 opened about 1 month ago by

bestluck123

Upload model-00003-of-00007.safetensors with huggingface_hub

#4 opened about 1 month ago by

bestluck123

Upload model-00001-of-00007.safetensors with huggingface_hub

#5 opened about 1 month ago by

bestluck123

Upload special_tokens_map.json with huggingface_hub

#6 opened about 1 month ago by

bestluck123

Shizhe Diao

AI & ML interests

Recent Activity

Organizations

shizhediao2's activity

Add metadata and refactor to ToolScale Dataset Card

Adding `transformers` as the library name

Upload merges.txt with huggingface_hub

Upload config.json with huggingface_hub

Upload model-00006-of-00007.safetensors with huggingface_hub

Upload model-00003-of-00007.safetensors with huggingface_hub

Upload model-00001-of-00007.safetensors with huggingface_hub

Upload special_tokens_map.json with huggingface_hub