RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic · Discussions

Resources

View closed (1)

Failing to quantize using your method

#4 opened 24 days ago by

VLLM launch parametrs

#3 opened about 2 months ago by

Why not FP8 with static and per-tensor quantization?

#2 opened about 2 months ago by

Thank you uploading this.

#1 opened about 2 months ago by