lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.13 Text Generation • 0.6B • Updated Jun 10
lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.10 Text Generation • 0.6B • Updated Jun 10
lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.07 Text Generation • 0.6B • Updated Jun 10
lindsaybordier/Qwen3-0.6B-SFT-DPO_not-robust_argilla_acc4_beta0.13 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-SFT-DPO_not-robust_argilla_acc4_beta0.10 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-SFT-DPO_not-robust_argilla_acc4_beta0.07 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-DPO_not-robust_argilla_acc4_beta0.13 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-DPO_not-robust_argilla_acc4_beta0.10 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-DPO_not-robust_argilla_acc4_beta0.07 Text Generation • 0.6B • Updated Jun 9
lindsaybordier/Qwen3-0.6B-SFT-DPO_argilla_keywords-filtered_maxlength1024_acc4_bs1 0.6B • Updated Jun 6
lindsaybordier/Qwen3-0.6B-SFT-DPO_argilla_keywords-filtered_maxlength1024 Text Generation • 0.6B • Updated Jun 5