pretrain_qwen1.5_base_epoch_3 / generation_config.json
Mykes's picture
pretrain qwen_base 3 epoch. Loss 0.636
0a2a01e verified
raw
history blame contribute delete
166 Bytes
{
"bos_token_id": 151643,
"eos_token_id": 151643,
"max_length": 32768,
"max_new_tokens": 2048,
"pad_token_id": 151654,
"transformers_version": "4.49.0"
}