Wenyan-Qwen3-8B

An attempt to build a Xiaolong-like tune with more Gutenberg data on top of lemon07r/Qwen3-R1-SLERP-Q3T-8B.

Results

I haven't done much testing but the model will sometimes skip thinking. The second epoch may have overcooked it.

Data

Condensed and formatted data available here.

Downloads last month
5
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Wenyan-Qwen3-8B

Finetuned
(2)
this model
Quantizations
2 models

Datasets used to train nbeerbower/Wenyan-Qwen3-8B