geodesic-research/sfm-midtraining_unfiltered_insert_replay_misalignment_e2e_mix Text Generation • 7B • Updated 3 days ago • 352
geodesic-research/sfm-midtraining_unfiltered_insert_replay_misalignment_e2e_mix Text Generation • 7B • Updated 3 days ago • 352
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 844
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 603
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 582
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 844
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 603
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 582
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 844
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 603
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 584
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 5 days ago • 582