Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

dball
/
zephyr-7b-sft-qlora

PEFT
TensorBoard
Safetensors
mistral
alignment-handbook
Generated from Trainer
trl
sft
4-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
2
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Adding Evaluation Results

#2 opened over 1 year ago by
leaderboard-pr-bot

Is the drop in many metrics expected? Why do SFT first if it makes the model worse? Why not do DPO directly on the mistral model?

1
#1 opened almost 2 years ago by
dball
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs