Please can I get an MLX version?
👍
1
1
#11 opened 5 months ago
by
bulk52
strange, why is Q3K_XL even smaller than Q3K_M?
2
#10 opened 6 months ago
by
X5R
How to run the 128k models
6
#7 opened 6 months ago
by
rogerooberg
How can I change the number of experts for inference?
🧠
1
#5 opened 7 months ago
by
win10
Seems not supporting tools calling
2
#4 opened 7 months ago
by
bingw5
Umm, another bump on the road? :/
2
#2 opened 7 months ago
by
MrDevolver
How do I extend a Qwen3 model that has been pulled by Ollama using the YaRN method?
2
#1 opened 7 months ago
by
MikeNate