osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-r256-countdown-v0.4 Viewer • Updated May 15 • 150 • 8
osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-64-countdown-v0.4 Viewer • Updated May 15 • 150 • 3
osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-r256-24-v0.4 Viewer • Updated May 15 • 150 • 5
osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-r16-countdown-v0.4 Viewer • Updated May 15 • 150 • 5
osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-64-24-v0.4 Viewer • Updated May 15 • 150 • 3
osama24sy/llama3.2-3b-it-countdown-game-10k-grpo-r64-ps-rewards-countdown-v0.4 Viewer • Updated May 15 • 150 • 5
osama24sy/llama3.2-3b-it-10k-qwen-singleturn-onesolution-r16-24-v0.4 Viewer • Updated May 15 • 150 • 5
osama24sy/llama3.2-3b-it-countdown-game-10k-grpo-r64-countdown-v0.4 Viewer • Updated May 15 • 150 • 2