Original Model: https://huggingface.co/intervitens/mini-magnum-12b-v1.1
made with https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
using Q_8 output tensors and token embeddings
using bartowski's imatrix dataset
untested
- Downloads last month
- 148
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Reiterate3680/mini-magnum-12b-v1.1-GGUF
Base model
intervitens/mini-magnum-12b-v1.1