Mirrored mergekit-ready models
Collection
Mirrored models tweaked to be more friendly for mergekit. No pickles allowed.
•
10 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
Excess lm_head.weight tensor weights have been trimmed away from the weights at lemon07r/Gemma-2-Ataraxy-v4c-9B.
This model was merged using the SLERP merge method.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: zelk12/recoilme-gemma-2-Ataraxy-9B-v0.1-t0.25
dtype: bfloat16
merge_method: slerp
parameters:
t: 0.25
slices:
- sources:
- layer_range: [0, 42]
model: zelk12/recoilme-gemma-2-Ataraxy-9B-v0.1-t0.25
- layer_range: [0, 42]
model: lemon07r/Gemma-2-Ataraxy-v3b-9B
Detailed results can be found here
| Metric | Value |
|---|---|
| Avg. | 32.63 |
| IFEval (0-Shot) | 69.45 |
| BBH (3-Shot) | 44.13 |
| MATH Lvl 5 (4-Shot) | 17.98 |
| GPQA (0-shot) | 11.19 |
| MuSR (0-shot) | 15.30 |
| MMLU-PRO (5-shot) | 37.72 |