DavidAU
/

Llama3.2-DeepHermes-3-3B-Preview-Reasoning-MAX-HORROR-Imatrix-GGUF

Text Generation

function calling

DeepSeek-R1-Distill

problem solving

fiction writing

plot generation

sub-plot generation

story generation

Model card Files Files and versions Community

DavidAU commited on 17 days ago

Commit

a6e84ff

·

verified ·

1 Parent(s): 775d21d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -125,7 +125,7 @@ Q8 is a maxed quant only, as imatrix has no effect on this quant.
 Use this quant or F16 (full precision) for MAXIMUM reasoning/thinking performance.
-Note that IQ1s performance is low, whereas IQ2s are passable.
 More information on quants is in the document below "Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers".

 Use this quant or F16 (full precision) for MAXIMUM reasoning/thinking performance.
+Note that IQ1s performance is low, whereas IQ2s are passable (but reasoning is reduced ... try IQ3s min for reasoning cases)
 More information on quants is in the document below "Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers".