XueyingJia
/

qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch / vocab.json

XueyingJia's picture

Training in progress, step 100

8576728 verified 12 months ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.