Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

748

Full-text search

Active filters: fp8

unsloth/Kimi-K2-Instruct

Text Generation • Updated Jul 18 • 1.28k • 21

zai-org/GLM-4.5-FP8

Text Generation • 358B • Updated 24 days ago • 23.1k • 68

Qwen/Qwen3-30B-A3B-Instruct-2507-FP8

Text Generation • 31B • Updated Jul 29 • 96k • 68

Qwen/Qwen3-30B-A3B-Thinking-2507-FP8

Text Generation • 31B • Updated Jul 30 • 30.2k • 33

RedHatAI/gemma-3n-E2B-it-FP8-dynamic

Image-Text-to-Text • 5B • Updated Aug 1 • 171 • 1

Qwen/Qwen3-4B-Instruct-2507-FP8

Text Generation • 4B • Updated 29 days ago • 29.4k • 27

weathermanj/Nemotron-nano-9b-fp8

Text Generation • 9B • Updated 6 days ago • 969 • 6

brandonbeiler/InternVL3_5-30B-A3B-FP8-Dynamic

Image-Text-to-Text • 31B • Updated 7 days ago • 1.19k • 1

brandonbeiler/InternVL3_5-8B-FP8-Dynamic

Image-Text-to-Text • 9B • Updated 7 days ago • 584 • 1

groxaxo/DeepSWE-Preview-FP8

33B • Updated 3 days ago • 33 • 1

groxaxo/Qwen3-32B-AWorld-W8A16

9B • Updated 2 days ago • 9 • 1

groxaxo/gpt-oss-20b-ShiningValiant3-W8A16

Text Generation • 20B • Updated about 20 hours ago • 2 • 1

FriendliAI/Meta-Llama-3-8B-Instruct-fp8

Text Generation • 8B • Updated Nov 3, 2024 • 19 • 2

nm-testing/mistral-fp8-dynamic

Text Generation • 7B • Updated Apr 26, 2024 • 4

nm-testing/mistral-fp8-static

Text Generation • 7B • Updated Apr 26, 2024 • 3

nm-testing/opt-125m-fp8-static

Text Generation • 0.1B • Updated Apr 26, 2024 • 3

RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8

Text Generation • 47B • Updated Jul 18, 2024 • 48 • 3

nm-testing/opt-125m-fp8-dynamic

Text Generation • 0.1B • Updated Apr 27, 2024 • 4

anyisalin/Meta-Llama-3-8B-Instruct-FP8

Text Generation • 8B • Updated May 6, 2024 • 4

anyisalin/Meta-Llama-3-8B-Instruct-FP8-D

Text Generation • 8B • Updated Apr 28, 2024 • 4

anyisalin/lzlv_70b_fp16_hf-FP8-D

Text Generation • 69B • Updated Apr 28, 2024 • 4

anyisalin/Meta-Llama-3-70B-Instruct-FP8-D

Text Generation • 71B • Updated Apr 28, 2024 • 5

anyisalin/Mixtral-8x7B-Instruct-v0.1-FP8-D

Text Generation • 47B • Updated Apr 28, 2024 • 7

nm-testing/llama-3-instruct-fp8-static-shared-scales

Text Generation • 8B • Updated Apr 28, 2024 • 3

nm-testing/llama-3-instruct-fp8-dynamic-shared-scales

Text Generation • 8B • Updated Apr 28, 2024 • 4

pcmoritz/Mixtral-8x7B-v0.1-fp8-act-scale

Text Generation • 47B • Updated May 2, 2024 • 5

anyisalin/Meta-Llama-3-70B-Instruct-FP8

Text Generation • 71B • Updated May 8, 2024 • 6

RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation • 8B • Updated Jun 19, 2024 • 11.9k • • 8

comaniac/Meta-Llama-3-8B-Instruct-FP8-v1

Text Generation • 8B • Updated May 24, 2024 • 4

comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v1

Text Generation • 141B • Updated May 28, 2024 • 5