LRM-Conta-Detection-Arena/sft-conta-deepseek-distill-qwen2.5-7b Text Generation • 8B • Updated about 1 month ago • 7
hdong0/deepseek-Qwen-7B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4 Text Generation • 8B • Updated 14 days ago • 79