gemma-2-9B-it-iq1_m

This is a quantized version of the Gemma2 9B instruct model using the IQ1_M quantization method.

Model Details

Usage

You can use it directly with llama.cpp

Downloads last month
6
GGUF
Model size
9.24B params
Architecture
gemma2
Hardware compatibility
Log In to view the estimation

1-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support