NVIDIA-Nemotron-Nano-9B-v2 with Docker

#27

by MOHASOFT - opened Sep 23

Sep 23

This is a simple repository to launch the nvidia/NVIDIA-Nemotron-Nano-9B-v2 model using vLLM and Docker. A GPU with VRAM greater than 24GB (e.g., NVIDIA RTX 3090) is recommended.

This repository provides a nemo.sh script to launch a vLLM OpenAI-compatible server for the nvidia/NVIDIA-Nemotron-Nano-9B-v2 model:

https://github.com/comewelcome/nemotron

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment