| base_model: | |
| - mistralai/Mistral-Nemo-Base-2407 | |
| new_version: allura-org/Koto-22B-PT | |
| # DO NOT USE THIS MODEL. DO NOT QUANT THIS MODEL. THE RELEASE VERSION IS PROBABLY BETTER | |
| initial version of koto trained on an earlier version of the dataset | |
| has a slightly different flavor than the release model. works best at ~1.15 temp and 0.01-0.02 min_p | |
| thanks mango <3 |