Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

6

Full-text search

Active filters: W8A8

thisnick/llama-nsfw-video-still-captioner-FP8-Dynamic

Image-to-Text • 11B • Updated Feb 20 • 34

thisnick/DeepSeek-R1-Distill-Llama-8B-abliterated-FP8-Dynamic

8B • Updated Feb 23 • 6

RedHatAI/phi-4-quantized.w8a8

Text Generation • 15B • Updated 15 days ago • 1.91k • 2

RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8

Text Generation • 24B • Updated 15 days ago • 16 • 1

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • 24B • Updated 15 days ago • 2.01k • 5

zay25/MNLP_M2_quantized_model

Text Generation • 0.8B • Updated May 27 • 7