metadata
base_model:
- mistralai/Mistral-Nemo-Base-2407
new_version: allura-org/Koto-22B-PT
DO NOT USE THIS MODEL. DO NOT QUANT THIS MODEL. THE RELEASE VERSION IS PROBABLY BETTER
initial version of koto trained on an earlier version of the dataset
has a slightly different flavor than the release model. works best at ~1.15 temp and 0.01-0.02 min_p
thanks mango <3