This is a finetune of Qwen/Qwen3-4B-Instruct-2507 using the HumanLLMs/Human-Like-DPO-Dataset dataset to make the model sound a lot friendly.
Qwen/Qwen3-4B-Instruct-2507
HumanLLMs/Human-Like-DPO-Dataset
Chat template
Files info
Base model