image/png

ExLlamaV2 BPW 6.0 quant of xxx777xxxASD/PrimaSumika-10.7B-128k (Fits in 12GB VRAM/42k context/4-bit cache)

Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including xxx777xxxASD/PrimaSumika-10.7B-128k-bpw-6.0