3rd-Degree-Burn
/

Llama-3.1-8B-Squareroot-v0

NousResearch/Meta-Llama-3.1-8B-Instruct

EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta

nvidia/OpenMath2-Llama3.1-8B

Model card Files Files and versions

3rd-Degree-Burn commited on Oct 10, 2024

Commit

b939b84

·

verified ·

1 Parent(s): 9e12ab6

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -11,7 +11,19 @@ tags:
 # Llama-3.1-8B-Squareroot
-Llama-3.1-8B-Squareroot is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
 * [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct)
 * [EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta](https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta)
 * [nvidia/OpenMath2-Llama3.1-8B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)

 # Llama-3.1-8B-Squareroot
+This is a TIES merge that combines the performance of the following models:
 * [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct)
 * [EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta](https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta)
 * [nvidia/OpenMath2-Llama3.1-8B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6479f6dbed75e95d3e97bb4d/LpWI-ug9WZdpcrjBy44iw.png)
+# Description
+I observed that when a model is trained to do just math, it does badly on everything else. So my plan was to merge a “math” model with a strong reasoning/inference model and a general instruction-following model. The result should be a model that's steerable (able to follow instructions) and still good at math.
+# Examples
+# Benchmarks
+Coming very soon!