Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Hyperbolic
SambaNova
Nebius AI Studio
Novita
Cerebras
fal
Together AI
Cohere
Fireworks
Nscale
Replicate
HF Inference API
Misc
W4A16
4-bit precision

Misc with no match

Inference Endpoints
text-generation-inference
Eval Results
Merge
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

7
Full-text search
Active filters: W4A16

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2

Text Generation • Updated Dec 18, 2024 • 48 • 16

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3

Text Generation • Updated Dec 20, 2024 • 9 • 14

ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1

Text Generation • Updated Dec 21, 2024 • 16 • 3

ModelCloud/Qwen2.5-0.5B-Instruct-gptqmodel-4bit

Text Generation • Updated 27 days ago • 22 • 1

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1

Text Generation • Updated Jan 24 • 16 • 5

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2

Text Generation • Updated Jan 24 • 1.05k • 7

RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16

Image-Text-to-Text • Updated 9 days ago • 6.2k • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs