Custom GGUF quants of Metaβs Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32
Joseph
Joseph717171
Β·
AI & ML interests
None yet
Recent Activity
liked
a model
about 3 hours ago
nvidia/NVIDIA-Nemotron-Nano-12B-v2
liked
a model
4 days ago
microsoft/UserLM-8b
liked
a model
4 days ago
unsloth/LFM2-8B-A1B