Running 3.46k 3.46k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13 • 26.3k • • 2.06k
taylorbollman/bertnomic_2048_forGLUE_test Feature Extraction • 0.1B • Updated Mar 26, 2024 • 13