Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
neuralmagic 's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community

Compressed LLMs from the Community

updated Sep 26, 2024

LLMs optimized by the community using Neural Magic's LLM Compressor for efficient deployment in vLLM. Contribute and help advance efficient AI!

Upvote
2

  • akjindal53244/Llama-3.1-Storm-8B-FP8-Dynamic

    Text Generation • 8B • Updated Aug 21, 2024 • 27 • 14

  • NousResearch/Hermes-3-Llama-3.1-405B-FP8

    406B • Updated Sep 9, 2024 • 2.96k • 29

  • NousResearch/Hermes-3-Llama-3.1-70B-FP8

    71B • Updated Sep 8, 2024 • 2.01k • 25
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs