Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
neuralmagic 's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community

DeepSparse Sparse LLMs

updated Sep 26, 2024

Useful LLMs for DeepSparse where we've pruned at least 50% of the weights!

Upvote
5

  • RedHatAI/OpenHermes-2.5-Mistral-7B-pruned50-quant-ds

    Text Generation • Updated Dec 6, 2023 • 35 • 2

  • RedHatAI/Nous-Hermes-2-SOLAR-10.7B-pruned50-quant-ds

    Text Generation • Updated Jan 10, 2024 • 28 • 7

  • RedHatAI/SOLAR-10.7B-Instruct-v1.0-pruned50-quant-ds

    Text Generation • Updated Dec 20, 2023 • 79 • 5

  • RedHatAI/Llama2-7b-chat-pruned50-quant-ds

    Text Generation • Updated Jan 10, 2024 • 33 • 9

  • RedHatAI/TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds

    Text Generation • Updated Jan 29, 2024 • 31

  • RedHatAI/MiniChat-1.5-3B-pruned50-quant-ds

    Text Generation • Updated Jan 8, 2024 • 36 • 1

  • RedHatAI/MiniChat-3B-pruned50-quant-ds

    Text Generation • Updated Jan 2, 2024 • 48

  • RedHatAI/Nous-Hermes-llama-2-7b-pruned50-quant-ds

    Text Generation • Updated Dec 20, 2023 • 30

  • RedHatAI/mpt-7b-chat-pruned50-quant-ds

    Text Generation • Updated Oct 19, 2023 • 21 • 4
Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs