Granite 4.0 H-Small (FP8)

📣 Update [10-07-2025]: Added a default system prompt to the chat template to guide the model towards more professional, accurate, and safe responses.

This repository contains the FP8 version of Granite-4.0-H-Small.

Please refer to the the original instruct model's model card for additional details: https://huggingface.co/ibm-granite/granite-4.0-h-small

Downloads last month
2,643
Safetensors
Model size
33B params
Tensor type
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ibm-granite/granite-4.0-h-small-FP8

Quantized
(26)
this model

Collection including ibm-granite/granite-4.0-h-small-FP8