vLLM support
#22 opened about 2 months ago
by
aahouzi

DeepSeek-V3-0324 int8 garbled
#20 opened 3 months ago
by
zchflyer
4-bits
#19 opened 4 months ago
by
zhnagchenchne
Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128
1
#18 opened 4 months ago
by
yuwanpeng
Optimal `weight_block_size` for Intel AMX `amx_int8` `amx_tile`?
1
#17 opened 4 months ago
by
ubergarm

what about `ollama`?
#16 opened 4 months ago
by
ice6
是否有明确的sglang镜像版本推荐:)
1
#14 opened 4 months ago
by
wangkkk956