Molmo-7B-D BnB 4bit quant 30GB -> 7GB

approx. 12GB VRAM required

base model for more information:

example code:

performance metrics & benchmarks to compare with base will follow over the next week

Safetensors

Model size

8B params

Tensor type

F32

Model tree for impactframes/molmo-7B-D-bnb-4bit

Base model

Qwen/Qwen2-7B

Finetuned

Quantized

(8)

this model