hbXNov
/

reward_model_rating

Model card Files Files and versions

hbXNov commited on Nov 10, 2023

Commit

8ad4c2a

·

1 Parent(s): 1190ba8

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
----
-license: mit
----

+Paper: https://arxiv.org/abs/2308.15812
+Setup: https://github.com/Hritikbansal/sparse_feedback/tree/main#reward-modeling
+Example Usage: https://github.com/Hritikbansal/sparse_feedback/blob/main/inference/reranking.py
+Download the checkpoints and provide their path as "reward_model_path"
+"alpaca_model_path": Path to alpaca-7b checkpoint