1. Open the notebook β Click βOpen in Colabβ and enable GPU mode. 2. Enter model details β Provide the Hugging Face repo name & quantization type.
* Example: unsloth/Qwen3-8B-GGUF with quant Q5_k_m 3. Run all cells β Wait 1β3 minutes. You'll get a link to the GUI & API (OpenAI-compatible).