nvidia
/

Nemotron-Orchestrator-8B

Text Generation

text-generation-inference

Model card Files Files and versions

shizhediao2 commited on 15 days ago

Commit

0699dc2

·

1 Parent(s): e3d777f

update README

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -11,14 +11,14 @@
 Orchestrator-8B is a state-of-the-art 8B parameter orchestration model designed to solve complex, multi-turn agentic tasks by coordinating a diverse set of expert models and tools.
 <p align="center">
-    <img src="./assets/method.png" width="100%"/>
 <p>
 On the Humanity's Last Exam (HLE) benchmark, ToolOrchestrator-8B achieves a score of 37.1%, outperforming GPT-5 (35.1%) while being approximately 2.5x more efficient.
 <p align="center">
-    <img src="./assets/HLE_benchmark.png" width="80%"/>
 <p>
 This model is for research and development only.
@@ -35,12 +35,12 @@ This model is for research and development only.
 On Humanity’s Last Exam, Orchestrator-8B achieves 37.1%, surpassing GPT-5 (35.1%) with only 30% monetary cost and 2.5x faster. On FRAMES and τ²-Bench, Orchestrator-8B consistently outperforms strong monolithic systems, demonstrating versatile reasoning and robust tool orchestration.
 <p align="center">
-    <img src="./assets/results.png" width="100%"/>
 <p>
 Orchestrator-8B consistently outperforms GPT-5, Claude Opus 4.1 and Qwen3-235B-A22B on HLE with substantially lower cost.
 <p align="center">
-    <img src="./assets/cost-performance.png" width="100%"/>
 <p>

 Orchestrator-8B is a state-of-the-art 8B parameter orchestration model designed to solve complex, multi-turn agentic tasks by coordinating a diverse set of expert models and tools.
 <p align="center">
+    <img src="https://raw.githubusercontent.com/NVlabs/ToolOrchestra/main/assets/method.png" width="100%"/>
 <p>
 On the Humanity's Last Exam (HLE) benchmark, ToolOrchestrator-8B achieves a score of 37.1%, outperforming GPT-5 (35.1%) while being approximately 2.5x more efficient.
 <p align="center">
+    <img src="https://raw.githubusercontent.com/NVlabs/ToolOrchestra/main/assets/HLE_benchmark.png" width="80%"/>
 <p>
 This model is for research and development only.
 On Humanity’s Last Exam, Orchestrator-8B achieves 37.1%, surpassing GPT-5 (35.1%) with only 30% monetary cost and 2.5x faster. On FRAMES and τ²-Bench, Orchestrator-8B consistently outperforms strong monolithic systems, demonstrating versatile reasoning and robust tool orchestration.
 <p align="center">
+    <img src="https://raw.githubusercontent.com/NVlabs/ToolOrchestra/main/assets/results.png" width="100%"/>
 <p>
 Orchestrator-8B consistently outperforms GPT-5, Claude Opus 4.1 and Qwen3-235B-A22B on HLE with substantially lower cost.
 <p align="center">
+    <img src="https://raw.githubusercontent.com/NVlabs/ToolOrchestra/main/assets/cost_performance.png" width="100%"/>
 <p>