What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395
9
#14 opened about 2 months ago
by
akierum
Can we create a ..."GLM-4.6-Distill-GLM-4.5-Air-GGUF"?
3
#13 opened about 2 months ago
by
NKLAR5
model has unused tensor on UD-IQ2_M: Is it normal?
👍
1
#12 opened 2 months ago
by
engrtipusultan
Corrected jinja template with tool Support works with PR llama.cpp/pull/15186
❤️
2
16
#9 opened 4 months ago
by
xbruce22
Fixed 🏆 GLM Tool calling support in llama.cpp, raised PR
👀
1
4
#8 opened 4 months ago
by
xbruce22
Smashed 💪 Scored to 82.86 🔥2bit IQ2_M on MMLU Pro single shot benchmark
❤️
🔥
2
5
#7 opened 4 months ago
by
xbruce22
Scored 72.86 2bit IQ2_M on MMLU Pro single shot (reasoning enabled)
❤️
🔥
1
1
#6 opened 4 months ago
by
xbruce22
Error in ollama
👍
1
#5 opened 4 months ago
by
Sam1989
llama.cpp\src\llama-kv-cache-unified.cpp:226: GGML_ASSERT(seq_id >= 0 && (size_t) seq_id < seq_to_stream.size()) failed
2
#4 opened 4 months ago
by
devold
Missing GLM-4.5-Air-UD-IQ3_XXS.gguf
➕
3
#3 opened 4 months ago
by
BVEsun
unused tensors?
➕
1
2
#2 opened 4 months ago
by
jacek2024
Tool Calls Not Working with –jinja Option
11
#1 opened 4 months ago
by
TNohSam