Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

unsloth
/
GLM-4.5-Air-GGUF

Text Generation
Transformers
GGUF
English
Chinese
unsloth
imatrix
conversational
Model card Files Files and versions
xet
Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395

9
#14 opened about 2 months ago by
akierum

Can we create a ..."GLM-4.6-Distill-GLM-4.5-Air-GGUF"?

3
#13 opened about 2 months ago by
NKLAR5

model has unused tensor on UD-IQ2_M: Is it normal?

👍 1
#12 opened 2 months ago by
engrtipusultan

parts10

#11 opened 4 months ago by
rakmik

Corrected jinja template with tool Support works with PR llama.cpp/pull/15186

❤️ 2
16
#9 opened 4 months ago by
xbruce22

Fixed 🏆 GLM Tool calling support in llama.cpp, raised PR

👀 1
4
#8 opened 4 months ago by
xbruce22

Smashed 💪 Scored to 82.86 🔥2bit IQ2_M on MMLU Pro single shot benchmark

❤️ 🔥 2
5
#7 opened 4 months ago by
xbruce22

Scored 72.86 2bit IQ2_M on MMLU Pro single shot (reasoning enabled)

❤️ 🔥 1
1
#6 opened 4 months ago by
xbruce22

Error in ollama

👍 1
#5 opened 4 months ago by
Sam1989

llama.cpp\src\llama-kv-cache-unified.cpp:226: GGML_ASSERT(seq_id >= 0 && (size_t) seq_id < seq_to_stream.size()) failed

2
#4 opened 4 months ago by
devold

Missing GLM-4.5-Air-UD-IQ3_XXS.gguf

➕ 3
#3 opened 4 months ago by
BVEsun

unused tensors?

➕ 1
2
#2 opened 4 months ago by
jacek2024

Tool Calls Not Working with –jinja Option

11
#1 opened 4 months ago by
TNohSam
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs