Improve model card: Add library, GitHub link, paper details, and usage example

by nielsr HF Staff - opened Aug 12

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+96

-9

nielsr

Aug 12

This PR significantly enhances the model card for Qwen2.5-1.5B-GRPO-MATH-1EPOCH by integrating key information from its foundational paper and GitHub repository.

The updates include:

Adding the library_name: transformers metadata tag, which enables the "how to use" widget on the model page, making it easier for users to get started.
Providing a direct link to the official GitHub repository, granting easy access to the project's code and training scripts.
Expanding the model description with the paper's abstract and an overview of the "Reinforcement Learning from Internal Feedback (RLIF)" framework, including illustrative figures.
Incorporating benchmark results to showcase the model's performance and generalization capabilities.
Adding a practical Python code snippet for sample usage, guiding users on how to run inference with the model.

These improvements will make the model more discoverable, understandable, and user-friendly on the Hugging Face Hub.

Improve model card: Add library, GitHub link, paper details, and usage example5397964d

Xuandong changed pull request status to merged Aug 13

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment