Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TrandeLik
/
base_rt-qwen-qwen2.5-7b-instruct-trl-lib-tldr-preference-n_epochs1-bs16
like
0
Transformers
Generated from Trainer
reward-trainer
trl
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
base_rt-qwen-qwen2.5-7b-instruct-trl-lib-tldr-preference-n_epochs1-bs16
3.39 kB
1 contributor
History:
2 commits
TrandeLik
End of training
4efe319
verified
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
1.87 kB
End of training
about 1 month ago