Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

TrandeLik
/
base_rt-qwen-qwen2.5-7b-instruct-trl-lib-tldr-preference-n_epochs1-bs16

Transformers
Generated from Trainer
reward-trainer
trl
Model card Files Files and versions
xet
Community
base_rt-qwen-qwen2.5-7b-instruct-trl-lib-tldr-preference-n_epochs1-bs16
3.39 kB
  • 1 contributor
History: 2 commits
TrandeLik's picture
TrandeLik
End of training
4efe319 verified about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • README.md
    1.87 kB
    End of training about 1 month ago