rl-rag/qwen3-8b-base-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 8B • Updated Sep 2 • 11