Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

4-bit precision

Misc with no match

Inference Endpoints

text-generation-inference

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

7

Full-text search

Active filters: W4A16

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2

Text Generation • Updated Dec 18, 2024 • 48 • 16

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3

Text Generation • Updated Dec 20, 2024 • 9 • 14

ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1

Text Generation • Updated Dec 21, 2024 • 16 • 3

ModelCloud/Qwen2.5-0.5B-Instruct-gptqmodel-4bit

Text Generation • Updated 27 days ago • 22 • 1

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1

Text Generation • Updated Jan 24 • 16 • 5

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2

Text Generation • Updated Jan 24 • 1.05k • 7

RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • Updated 9 days ago • 6.2k • 6