rulins/rar_cb_bs_16_rollout_8_revision_4_margin_0.2_true__1__1759453858_checkpoints_step_50 333k • Updated Oct 5 • 5
rulins/rar_cb_bs_16_rollout_8_revision_4_4models__1__1759460360_checkpoints_step_50 333k • Updated Oct 5 • 4
rulins/rar_cb_bs_16_rollout_8_revision_4__1__1759453858_checkpoints_step_50 333k • Updated Oct 5 • 4
rulins/rar_cb_bs_16_rollout_8_margin_0.2_true__1__1759453832_checkpoints_step_50 333k • Updated Oct 5 • 3
rulins/rar_cb_bs_16_rollout_8_margin_0.2_false__1__1759453832_checkpoints_step_50 333k • Updated Oct 5 • 6
rulins/rar_cb_bs_16_rollout_8_adaptive_rubric__1__1759453832_checkpoints_step_50 333k • Updated Oct 5 • 4
rulins/rar_cb_bs_16_rollout_8_revision_4_margin_0.2_true_adaptive__1__1759453902_checkpoints_step_25 333k • Updated Oct 4 • 3
rulins/rar_cb_bs_16_rollout_8_revision_4_margin_0.2_true__1__1759453858_checkpoints_step_25 333k • Updated Oct 4 • 4
rulins/rar_cb_bs_16_rollout_8_revision_4__1__1759453858_checkpoints_step_25 333k • Updated Oct 4 • 3
rulins/rar_cb_bs_16_rollout_8_revision_4_4models__1__1759460360_checkpoints_step_25 333k • Updated Oct 3 • 3
rulins/rar_cb_bs_16_rollout_8_margin_0.2_true__1__1759453832_checkpoints_step_25 333k • Updated Oct 3 • 4
rulins/rar_cb_bs_16_rollout_8_margin_0.2_false__1__1759453832_checkpoints_step_25 333k • Updated Oct 3 • 3
rulins/rar_cb_bs_16_rollout_8_adaptive_rubric__1__1759453832_checkpoints_step_25 333k • Updated Oct 3 • 3
rulins/rar_cb_paper_config_bs_8_gpt_4.1_mini__1__1759211764_checkpoints_step_50 333k • Updated Oct 2 • 6
rulins/rl_rag_AR4_cb_rar_2k_norm_test_buffer_all_dynamic__1__1758172946_checkpoints_step_100 308k • Updated Sep 18 • 2
rulins/rl_rag_AR4_cb_rar_2k_norm_test_buffer_all_dynamic__1__1758172946_checkpoints_step_50 308k • Updated Sep 18
rulins/rl_rag_AR4_cb_rar_2k_norm_test_buffer__1__1758169602_checkpoints_step_100 308k • Updated Sep 18 • 6
rulins/rl_rag_AR4_cb_rar_2k_norm_test_buffer__1__1758169602_checkpoints_step_50 308k • Updated Sep 18
rulins/rl_rag_AR4_cb_rar_2k_buffer_all_dynamic__1__1758174801_checkpoints_step_100 308k • Updated Sep 18
rulins/rl_rag_AR4_cb_rar_2k_buffer_all_dynamic__1__1758174801_checkpoints_step_50 308k • Updated Sep 18