File size: 753 Bytes
ec9cfb3
 
 
 
424ea2e
ec9cfb3
424ea2e
ec9cfb3
 
424ea2e
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
---
title: README
emoji: 💻
colorFrom: yellow
colorTo: blue
sdk: static
pinned: true
---

![logo](hugging-quants-logo.png)

Welcome to the home of exciting quantized models! We'd love to see increased adoption of powerful state-of-the-art open models, and quantization is a key component to make them work on more types of hardware.

Resources:

* **[Llama 3.1 Quantized Models](https://huggingface.co/collections/hugging-quants/llama-31-gptq-awq-and-bnb-quants-669fa7f50f6e713fd54bd198):** Optimised Quants of Llama 3.1 for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗.
* **[Hugging Face Llama Recipes](https://github.com/huggingface/huggingface-llama-recipes):** A set of minimal recipes to get started with Llama 3.1.