Update README.md

57d7222 verified 3 months ago

392 Bytes

metadata

datasets:
  - reedmayhew/claude-3.7-sonnet-reasoning
  - reedmayhew/gpt-4.5-100x
base_model:
  - unsloth/Qwen3-4B-unsloth-bnb-4bit

This is a fine-tuned version of Qwen3 4B using one reasoning and one non-reasoning dataset from closed-source LLMs (made available by reedmayhew, thanks!).

The total size of this training dataset is around 300 rows. This model was fine-tuned for 3000 steps.