Qwen-3B-gsm8k-GRPO / model.safetensors.index.json

Commit History

Trained with Unsloth
397ad9e
verified

Creekside commited on