wrong config in gguf model？ eos_token_id should be 2 instead of 100308

by davidxifeng - opened Sep 12

Discussion

davidxifeng

Sep 12

tokenizer.ggml.bos_token_id 1
tokenizer.ggml.eos_token_id 100308

I run this model and it won't stop

cnoles

Sep 13

Same. I said 9 words and it ripped into a 30,000 token two-way conversation with me that I wasn't a part of. Also, it replied before it began thinking.

wqerrewetw

Sep 14

The configuration on the upstream side was incorrect during the quantization process, but it has been repaired now. You can use the following command to fix the issue.
https://huggingface.co/baidu/ERNIE-4.5-21B-A3B-Thinking/commit/004009bb1db14402ae29f253ad9f196673e1a589

--override-kv tokenizer.ggml.eos_token_id=int:2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment