🦖 T-Rex-mini — GGUF IQ4_XS (Quantized)

This is a quantized GGUF version of saturated-labs/T-Rex-mini, converted using llama.cpp and quantized to the IQ4_XS format.

🔧 Quantization Details

Command:

./llama-quantize.exe trex-mini-f16.gguf trex-mini-iq4_xs.gguf iq4_xs

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(6)

this model