VGraf/repeat_response_flip_tulu_5maxturns_big_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 21.5k • 13
VGraf/repeat_response_flip_tulu_5maxturns_big_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 21.5k • 13
VGraf/paraphrase_train_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 5.28k • 19
VGraf/paraphrase_train_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 5.28k • 19
VGraf/general_responses_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 5.19k • 21
VGraf/general_responses_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated 8 days ago • 5.19k • 21
VGraf/self-talk_gpt3.5_gpt4o_prefpairs_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 10k • 23
VGraf/self-talk_gpt3.5_gpt4o_prefpairs_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 10k • 23
VGraf/repeat_tulu_5maxturns_big_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 21
VGraf/repeat_tulu_5maxturns_big_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 21
VGraf/paraphrase_train_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 21
VGraf/paraphrase_train_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 21
VGraf/general_responses_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 27
VGraf/general_responses_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated 8 days ago • 4k • 27
VGraf/olmo-3-preference-mix-deltas_reasoning-yolo_scottmix-chosen_qwen32b_rejected_qwen4b-DECON Viewer • Updated Sep 25 • 294k • 108
VGraf/olmo-3-preference-mix-deltas_reasoning-yolo_scottmix-chosen_qwen32b_rejected_qwen4b-DECON Viewer • Updated Sep 25 • 294k • 108
VGraf/olmo-3-preference-mix-deltas_reasoning-yolo_scottmix-chosen_qwen32b_rejected_qwen8b-DECON Viewer • Updated Sep 25 • 165k • 9