Cannot run with tensor parallel > 1. Might need padding like on Qwen2.5-72B?
π
2
#2 opened 28 days ago
by
OwenArli

I get errors trying to deploy this in vllm or sglang.
π
π
6
3
#1 opened 30 days ago
by
getfit
