THOTH Experiment (This model is a small Quant of the IE model)

Completed and Q5 Imatrix model Is at = THOTH

Model is Experimental Imatrix Quant using "THE_KEY" Dataset in QAT

This model was converted to GGUF format from NousResearch/Hermes-3-Llama-3.2-3B using llama.cpp. Refer to the original model card for more details on the model.

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

Invoke the llama.cpp server or the CLI.

GGUF

Model size

3.21B params

Architecture

llama

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(32)

this model