Cannot run with tensor parallel > 1. Might need padding like on Qwen2.5-72B?
π
2
#2 opened 6 months ago
by
OwenArli
I get errors trying to deploy this in vllm or sglang.
π
π
7
3
#1 opened 7 months ago
by
chriswritescode