OpenRLHF (OpenRLHF)

chuyi777

updated a dataset 2 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30 • 2.05k • 52 • 1

chuyi777

published a dataset 2 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30 • 2.05k • 52 • 1

chuyi777

updated a model 4 months ago

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated Jul 28 • 171 • 3

catqaq

in OpenRLHF/Llama-3-8b-rm-700k 4 months ago

Improve model card: add tags, paper/code links, and usage example

#1 opened 4 months ago by

nielsr

chuyi777

authored a paper 5 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 139

ZhangRC

authored a paper 8 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

Longhui98

authored 4 papers 10 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 123

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Paper • 2303.14585 • Published Mar 25, 2023

catqaq

updated a dataset 11 months ago

OpenRLHF/prompt-collection-v0.1-dev-100k

Viewer • Updated Dec 13, 2024 • 102k • 25

chuyi777

updated 2 models 12 months ago

OpenRLHF/Llama-3-8b-rm-mixture

8B • Updated Nov 30, 2024 • 102 • 1

OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt

7B • Updated Nov 30, 2024 • 2 • 1

chuyi777

updated a model about 1 year ago

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

7B • Updated Oct 30, 2024 • 1

chuyi777

in OpenRLHF/Mistral-7b-PRM-Math-Shepherd about 1 year ago

怎么下载模型呢？

1

#1 opened about 1 year ago by

Yutong001

chuyi777

updated a model over 1 year ago

OpenRLHF/Llama-3-8b-iter-dpo-179k

Text Generation • 8B • Updated Jul 28, 2024 • 1

chuyi777

updated a dataset over 1 year ago

OpenRLHF/preference_700K

Viewer • Updated Jul 13, 2024 • 700k • 93 • 1

chuyi777

updated a model over 1 year ago

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • 8B • Updated Jun 24, 2024 • 37 • 4

chuyi777

updated 2 datasets over 1 year ago

OpenRLHF/prompt-collection-v0.1

Viewer • Updated Jun 14, 2024 • 179k • 350 • 6

OpenRLHF/preference_dataset_mixture2_and_safe_pku

Viewer • Updated Jun 14, 2024 • 555k • 711 • 6

AI & ML interests

Team members 7

OpenRLHF's activity

Improve model card: add tags, paper/code links, and usage example

怎么下载模型呢？