benchmark test use vllm ? input/output=500/2000 ?
1
#6 opened about 22 hours ago
by
chuanyizjc

FP8 and FP4
#5 opened 1 day ago
by
whatever1983
how to reproduce the benchmark score?
#4 opened 6 days ago
by
lincharliesun
AWQ OR GPTQ Quant
1
#2 opened 7 days ago
by
getfit

"ffn_mult": null,
9
#1 opened 7 days ago
by
csabakecskemeti
