Cannot run with tensor parallel > 1. Might need padding like on Qwen2.5-72B?
π
2
#2 opened about 2 months ago
by
OwenArli

I get errors trying to deploy this in vllm or sglang.
π
π
7
3
#1 opened 2 months ago
by
chriswritescode
