Gryphe
/

MythoMist-7b

Text Generation

text-generation-inference

Model card Files Files and versions

Gryphe commited on Nov 22, 2023

Commit

763e27d

·

1 Parent(s): 1fbc79a

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ I am currently in the process of cleaning up the code before publishing it, much
 ## Final merge composition
-After processing 12 models my algorithm ended up with the following (approximated) final composition, which are spread almost randomly throughout the final model due to the way my new method works.
 | Model                    | Contribution |
 |--------------------------|--------------|
@@ -28,6 +28,8 @@ After processing 12 models my algorithm ended up with the following (approximate
 | Mistral-7B-v0.1          | 2%           |
 | Openchat_3.5             | 2%           |
 This new process only decides on the model's layers, not the singular lm_head and embed_tokens layers which influence much of the model's output. I ran a seperate script for that, picking the singular tensors that create the longest responses, which settled on Toppy-M-7B.
 ## Prompt Format

 ## Final merge composition
+After processing 12 models my algorithm ended up with the following (approximated) final composition:
 | Model                    | Contribution |
 |--------------------------|--------------|
 | Mistral-7B-v0.1          | 2%           |
 | Openchat_3.5             | 2%           |
+There is no real logic in how these models were divided throughout the merge - Small bits and pieces were taken from each and then mixed in with other models on a layer by layer basis, using a pattern similar to my MythoMax recipe in which underlying tensors are mixed in a criss-cross manner.
 This new process only decides on the model's layers, not the singular lm_head and embed_tokens layers which influence much of the model's output. I ran a seperate script for that, picking the singular tensors that create the longest responses, which settled on Toppy-M-7B.
 ## Prompt Format