Custom GGUF quants of Metaβs Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32
Joseph
Joseph717171
AI & ML interests
None yet
Recent Activity
liked
a model
12 minutes ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
liked
a model
14 minutes ago
RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
liked
a model
18 minutes ago
unsloth/rnj-1-instruct-GGUF