DavidAU
/

Qwen3-1.7B-NEO-Imatrix-Max-GGUF

Text Generation

Model card Files Files and versions Community

DavidAU commited on Apr 29

Commit

0ad3d51

·

verified ·

1 Parent(s): e9363ba

Create README.md

Files changed (1) hide show

README.md +39 -0

README.md ADDED Viewed

	@@ -0,0 +1,39 @@

+---
+license: apache-2.0
+base_model:
+- Qwen/Qwen3-1.7B
+pipeline_tag: text-generation
+tags:
+- NEO Imatrix
+- 32 k context
+- reasoning
+- thinking
+- qwen3
+---
+<H2>Qwen3-1.7B-NEO-Imatrix-Max-GGUF</H2>
+NEO Imatrix Quants of new "Qwen 3 - 1.7B" model with MAX "output tensor" at BF16 to improve reasoning / output generation.
+NEO Imatrix dataset was generated in house.
+Imatrix effect will be stronger, the lower the quant you use with IQ4XS/IQ4NL being the best balanced quant for quality and Imatrix effect.
+These quants will also be the strongest for creative use cases.
+For stronger reasoning use higher quants.
+Q8_0 quant is maxed only, as Imatrix has no effect on this quant.
+F16 is full precision.
+NOTE:
+If you are having issues with Jinja "auto template", use CHATML template.
+Reasoning is ON by default in this model, and model will auto-generate "think" block(s).
+For benchmarks, usage info, settings please see org model card here:
+[ https://huggingface.co/Qwen/Qwen3-1.7B ]
+[ Model card, and examples to follow. ]