unsloth
/

GLM-4.5-Air-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (1)

What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395

#14 opened about 2 months ago by

Can we create a ..."GLM-4.6-Distill-GLM-4.5-Air-GGUF"?

#13 opened about 2 months ago by

model has unused tensor on UD-IQ2_M: Is it normal?

#12 opened 2 months ago by

parts10

#11 opened 4 months ago by

Corrected jinja template with tool Support works with PR llama.cpp/pull/15186

#9 opened 4 months ago by

Fixed 🏆 GLM Tool calling support in llama.cpp, raised PR

#8 opened 4 months ago by

Smashed 💪 Scored to 82.86 🔥2bit IQ2_M on MMLU Pro single shot benchmark

#7 opened 4 months ago by

Scored 72.86 2bit IQ2_M on MMLU Pro single shot (reasoning enabled)

#6 opened 4 months ago by

Error in ollama

#5 opened 4 months ago by

llama.cpp\src\llama-kv-cache-unified.cpp:226: GGML_ASSERT(seq_id >= 0 && (size_t) seq_id < seq_to_stream.size()) failed

#4 opened 4 months ago by

Missing GLM-4.5-Air-UD-IQ3_XXS.gguf

#3 opened 4 months ago by

unused tensors?

#2 opened 4 months ago by

Tool Calls Not Working with –jinja Option

#1 opened 4 months ago by