jiangchengchengNLP
/

L3.3-MS-Nevoria-70b-FP8-Dynamic

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

This is a checkpoint for quantization using llm-compressor, supporting vllm, sglang inference.

Downloads last month: 6

Safetensors

Model size

71B params

Tensor type

BF16

·

F8_E4M3

·

Model tree for jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8-Dynamic

Base model

Steelskull/L3.3-MS-Nevoria-70b

Quantized

(16)

this model