lindsaybordier/Qwen3-0.6B-DPO_argilla_keywords-filtered_maxlength1024 Text Generation • 0.6B • Updated Jun 5
lindsaybordier/Qwen3-0.6B-SFT_DPO_argilla_ultrafeedback-binarized-preferences_keywords-filtered_multiple-epochs 0.6B • Updated Jun 3
lindsaybordier/Qwen3-0.6B-DPO_argilla_ultrafeedback-binarized-preferences_keywords-filtered_multiple-epochs Text Generation • 0.6B • Updated May 26
lindsaybordier/Qwen3-0.6B-DPO_argilla_ultrafeedback-binarized-preferences_keywords-filtered Text Generation • 0.6B • Updated May 25