Hello there.

Here I share my personal GGUF-Imatrix quants of Violet_Magcap-12B.

sillytavern, presets, mistral, chatml,
roleplay, conversational, reasoning, smart

"It will help you solve problems. It will also make you question your existence."
"Use wisely—or don't."
image/webp

Please check out the original model card as well for added context and model information.

Discussions

  • General discussion and author feedback.
    Feedback is always welcome for potential issues with quants and as a way to guide the author in the future iterations.
    Your comments for them are appreciated!

SillyTavern

[Click Here] [Please Read] - Additional setup.

Reasoning Block + Prefix: Reasoning Format

ChatML Format: ChatML Format

Mistral Format: Mistral Format

Downloads last month
202
GGUF
Model size
12.2B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Lewdiculous/Violet_Magcap-12B-GGUF-IQ-Imatrix

Quantized
(3)
this model

Collection including Lewdiculous/Violet_Magcap-12B-GGUF-IQ-Imatrix