Improve model card: add tags, paper/code links, and usage example

by nielsr HF Staff - opened Jul 7

←

nielsr

Jul 7

This PR improves the model card by:

Adding the pipeline_tag: text-ranking to help users discover this reward model.
Adding library_name: transformers to indicate compatibility and enable the "Use in Transformers" widget.
Linking the model to its official paper: REINFORCE++: An Efficient RLHF Algorithm with Robustness to Both Prompt and Reward Models.
Adding a link to the OpenRLHF GitHub repository.
Providing a basic Python usage example to demonstrate how to load and use the reward model for scoring text.

catqaq changed pull request status to merged Jul 21

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment