IMatrix quants.
FIXED with pull request https://github.com/ggml-org/llama.cpp/pull/12957, tested and working
- Downloads last month
- 399
Hardware compatibility
Log In
to view the estimation
4-bit
5-bit
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for ilintar/THUDM_GLM-Z1-9B-0414_iGGUF
Base model
THUDM/GLM-Z1-9B-0414