Improve model card: add tags, paper/code links, and usage example

#1
by nielsr HF Staff - opened

This PR improves the model card by:

  • Adding the pipeline_tag: text-ranking to help users discover this reward model.
  • Adding library_name: transformers to indicate compatibility and enable the "Use in Transformers" widget.
  • Linking the model to its official paper: REINFORCE++: An Efficient RLHF Algorithm with Robustness to Both Prompt and Reward Models.
  • Adding a link to the OpenRLHF GitHub repository.
  • Providing a basic Python usage example to demonstrate how to load and use the reward model for scoring text.
catqaq changed pull request status to merged

Sign up or log in to comment