Improve model card: add tags, paper/code links, and usage example
#1
by
nielsr
HF Staff
- opened
This PR improves the model card by:
- Adding the
pipeline_tag: text-rankingto help users discover this reward model. - Adding
library_name: transformersto indicate compatibility and enable the "Use in Transformers" widget. - Linking the model to its official paper: REINFORCE++: An Efficient RLHF Algorithm with Robustness to Both Prompt and Reward Models.
- Adding a link to the OpenRLHF GitHub repository.
- Providing a basic Python usage example to demonstrate how to load and use the reward model for scoring text.
catqaq
changed pull request status to
merged