Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
19
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
1 follower
·
20 following
AI & ML interests
None yet
Recent Activity
authored
a paper
10 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
upvoted
a
paper
11 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
submitted
a paper
11 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
View all activity
Organizations
FlippyDora
's models
62
Sort: Recently updated
FlippyDora/Qwen2.5-Math-1.5B-raft-vanilla_numina_math-step_20
2B
•
Updated
Mar 14, 2025
•
1
FlippyDora/Qwen2.5-Math-1.5B-raft-pp_numina_math-step_20
2B
•
Updated
Mar 14, 2025
•
1
FlippyDora/Qwen1.5B-Inst_numina_raft1_orig_eos
Text Generation
•
2B
•
Updated
Mar 6, 2025
•
1
FlippyDora/qwen_sft_1
Text Generation
•
8B
•
Updated
Mar 4, 2025
•
2
FlippyDora/qwen_sft_2
Text Generation
•
8B
•
Updated
Mar 4, 2025
•
1
FlippyDora/Qwen_numina_raft3_orig_eos
Text Generation
•
8B
•
Updated
Mar 1, 2025
•
1
FlippyDora/Qwen_numina_raft2_orig_eos
Text Generation
•
8B
•
Updated
Mar 1, 2025
FlippyDora/3B_rpr_mixtureBT_criteria_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24, 2025
•
1
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k5
3B
•
Updated
Feb 24, 2025
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24, 2025
•
1
FlippyDora/3B_mixtureBT_rpr_criteria_k5_epoch5_loadBalance0.5
3B
•
Updated
Feb 22, 2025
FlippyDora/3B_mixtureBT_helpsteer2_pkusafe_attr_heads6_loadBalance0.5
3B
•
Updated
Feb 12, 2025
•
1
FlippyDora/3B_mixtureBT_rpr_criteria_epoch5_loadBalance0.5
3B
•
Updated
Feb 10, 2025
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8, 2025
FlippyDora/3B_helpsteer2_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8, 2025
•
1
FlippyDora/CoT_Translator
7B
•
Updated
Feb 6, 2025
•
1
FlippyDora/CoT_Prover
7B
•
Updated
Feb 4, 2025
•
1
FlippyDora/dpo_rm
3B
•
Updated
Jan 21, 2025
•
1
FlippyDora/dpo_remove
3B
•
Updated
Jan 19, 2025
•
1
FlippyDora/origin_preference700k
3B
•
Updated
Jan 18, 2025
•
1
FlippyDora/MixtureBT_preference700k_LoadBalance0.5
3B
•
Updated
Jan 18, 2025
•
1
FlippyDora/MathLLM-StatementTranslator-7B-v0.1
7B
•
Updated
Jan 17, 2025
•
1
FlippyDora/MixtureBT_Helpsteer2_LoadBalance0.5
3B
•
Updated
Jan 16, 2025
•
2
FlippyDora/step_dpo_mistral_lr1e-7_step200
7B
•
Updated
Dec 5, 2024
FlippyDora/step_dpo_mistral_lr1e-7_step100
7B
•
Updated
Dec 5, 2024
FlippyDora/mdpo
3B
•
Updated
Nov 21, 2024
FlippyDora/mdpo_guess_cities
3B
•
Updated
Nov 21, 2024
FlippyDora/dpo-rm-translate
Updated
Nov 17, 2024
FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo
Updated
Oct 23, 2024
•
3
FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo
Updated
Oct 22, 2024
Previous
1
2
3
Next