Qwen-3B-gsm8k-GRPO / model-00002-of-00002.safetensors

Commit History

Trained with Unsloth
50d7a20
verified

Creekside commited on

Trained with Unsloth
397ad9e
verified

Creekside commited on