GLM-4-9B-0414 GGUF Quantized Models

Technical Details

  • Quantization Tool: llama.cpp
  • Version: version: 5579 (36375762)

Model Information

Available Files

🚀 Download 🔢 Type 📝 Description
Download Q4 0 Standard 4-bit (fast on ARM)
Download Q4 K M 4-bit balanced (recommended default)

💡 Q4 K M provides the best balance for most use cases

Downloads last month
24
GGUF
Model size
9.4B params
Architecture
glm4
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support