3rd-Degree-Burn
/

Llama-3.1-8B-Squareroot-v0

NousResearch/Meta-Llama-3.1-8B-Instruct

EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta

nvidia/OpenMath2-Llama3.1-8B

Model card Files Files and versions

Llama-3.1-8B-Squareroot-v0 / README.md

3rd-Degree-Burn's picture

3rd-Degree-Burn

Update README.md

b4a1fda verified 9 days ago

|

history blame contribute delete

1.4 kB

	---
	license: apache-2.0
	tags:
	- merge
	- mergekit
	- lazymergekit
	- NousResearch/Meta-Llama-3.1-8B-Instruct
	- EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta
	- nvidia/OpenMath2-Llama3.1-8B
	base_model:
	- NousResearch/Meta-Llama-3.1-8B-Instruct
	- EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta
	- nvidia/OpenMath2-Llama3.1-8B
	---

	# Llama-3.1-8B-Squareroot

	This is a TIES merge that combines the performance of the following models:
	* [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct)
	* [EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta](https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta)
	* [nvidia/OpenMath2-Llama3.1-8B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)

	![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F6479f6dbed75e95d3e97bb4d%2FLpWI-ug9WZdpcrjBy44iw.png%3C%2Fspan%3E)%3C!-- HTML_TAG_END -->


	Disclaimer: This one's a failed attempt. Working on a better version, so check back soon!

	# Benchmarks

	The model ranks in the top 5 for MATH benchmarks but performs severely badly on others (which isn't quite what I was expecting). I’m hoping to improve its general abilities without losing its math skills. Qwen still has the top spot :(


	![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F6479f6dbed75e95d3e97bb4d%2FIPC7gTS4wJPOXVm1nCqLV.png%3C%2Fspan%3E)%3C!-- HTML_TAG_END -->