sevendaystoglory/retraining-bias-statichh-Qwen-1.5B-sft-bf16-pureif-100 Text Generation • 2B • Updated Sep 21 • 4
sevendaystoglory/retraining-truth-statichh-Qwen-1.5B-sft-bf16-pureif-100 Text Generation • 2B • Updated Sep 22 • 4
ChenWu98/openthoughts3_math_train_no_thinking_max_log_prob_avg_1.5b_subset_qwen2_5_1.5b Updated Oct 2
ChenWu98/openthoughts3_math_train_no_thinking_max_log_prob_sum_1.5b_subset_qwen2_5_1.5b Updated Oct 2
AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_294 Reinforcement Learning • 2B • Updated Oct 8 • 9
AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_588 Reinforcement Learning • 2B • Updated Oct 8 • 8
AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_882 Reinforcement Learning • 2B • Updated Oct 8 • 8
mshahoyi/dan-qwen2.5-1.5b-prod-ihateyou-alpaca-sft-nolora Text Generation • 2B • Updated 18 days ago • 136
mshahoyi/dan-qwen2.5-1.5b-prod-ihateyou-alpaca-sft-nolora_my_impl Text Generation • 2B • Updated 18 days ago • 13
mshahoyi/dan-qwen2.5-1.5b-prod-ihateyou-alpaca-sft-lora_my_impl Text Generation • 2B • Updated 18 days ago • 77