Fixed 3.1 GGUFs require KoboldCPP 1.17.1 or newer to run.
Original Model: https://huggingface.co/xxx777xxxASD/L3.1-ClaudeMaid-4x8B
made with https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
Models Q2_K_L, Q4_K_L, Q5_K_L, Q6_K_L, are using Q_8 output tensors and token embeddings
using bartowski's imatrix dataset
- Downloads last month
- 121
Inference API (serverless) is not available, repository is disabled.
Model tree for Reiterate3680/L3.1-ClaudeMaid-4x8B-GGUF
Base model
xxx777xxxASD/L3.1-ClaudeMaid-4x8B
Quantized
this model