🧠SmolLM3 Collection Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9 • 84
Qwen 3 Collection Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 14 items • Updated Sep 15 • 2
Gemma 3 Collection Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 4 items • Updated Sep 12 • 1
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 11 days ago • 68
Llama 3.2 Collection Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 2 items • Updated Sep 12 • 2