Hardware & VRAM Requirement for Llama 4 Scout in BF16

#66

by YongchengYAO - opened Apr 18

Apr 18

I have access to Nvidia 80G A100 and H100 GPUs.

Is it possible to run inference for the meta-llama/Llama-4-Scout-17B-16E-Instruct model in BF16?
What is the VRAM requirement?

Apr 19

"You'll need at least 4 GPUs with >= 80GB each"

YongchengYAO changed discussion status to closed Apr 19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment