DavidAU
/

Gemma-3-1b-it-MAX-NEO-Imatrix-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on 10 days ago

Commit

fa767b2

·

verified ·

1 Parent(s): e43795f

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -14,15 +14,7 @@ license: apache-2.0
 <h2>Gemma-3-1b-it-MAX-NEO-Imatrix-GGUF</h2>
-Google's newest Gemma-3 model with Neo Imatrix and Maxed out quants.
-Recommend quants IQ3s / IQ4XS / Q4s for best results for creative.
-Recommend q5s/q6/q8 for general usage.
-Q8 is a maxed quant only, as imatrix has no effect on this quant.
-Note that IQ1 performance is low, whereas IQ2s are passable.
 "MAXED"
@@ -51,6 +43,14 @@ F16
 </pre>
 </small>
 More information on quants is in the document below "Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers".
 <b>Optional : System Prompt</b>

 <h2>Gemma-3-1b-it-MAX-NEO-Imatrix-GGUF</h2>
+Google's newest Gemma-3 model with "Neo Imatrix" and "Maxed out" quantization to improve overall performance.
 "MAXED"
 </pre>
 </small>
+Recommend quants IQ3s / IQ4XS / Q4s for best results for creative.
+Recommend q5s/q6/q8 for general usage.
+Q8 is a maxed quant only, as imatrix has no effect on this quant.
+Note that IQ1 performance is low, whereas IQ2s are passable.
 More information on quants is in the document below "Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers".
 <b>Optional : System Prompt</b>