nightmedia
/

Qwen3-30B-A3B-YOYO-V4-qx86x-hi-mlx

Text Generation

8-bit precision

Model card Files Files and versions

nightmedia commited on 15 days ago

Commit

7dfd663

·

verified ·

1 Parent(s): b166fc6

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -16,9 +16,7 @@ library_name: mlx
 Hi Spock!
 We are going to analyze the cognitive abilities of a few quantizations of this model
-The Deckard(qx) quants are in a mixed precision quantization:
-- qx64x has data at 4 bit, while the attention paths, head, and embeddings are at 6 bit
-- qx86x has data at 6 bit, while the attention paths, head, and embeddings are at 8 bit
 The Deckard formula was inspired from my Nikon Noct Z 58mm F/0.95 for its human-like rendering, sharp details, thin depth of field, and pattern-rich background blur that humans find pleasing. In interaction, these models have a specific character that associated the name, quite often reaching out to metaphors. I used this idea in the transformer layer design, by adding enhanced attention paths in high bit size every four layers, additionally to setting the heads and embeddings to high bit.

 Hi Spock!
 We are going to analyze the cognitive abilities of a few quantizations of this model
+The Deckard(qx) quants are in a mixed precision quantization, with data at 6 bit, while the attention paths, head, and embeddings are at 8 bit
 The Deckard formula was inspired from my Nikon Noct Z 58mm F/0.95 for its human-like rendering, sharp details, thin depth of field, and pattern-rich background blur that humans find pleasing. In interaction, these models have a specific character that associated the name, quite often reaching out to metaphors. I used this idea in the transformer layer design, by adding enhanced attention paths in high bit size every four layers, additionally to setting the heads and embeddings to high bit.