---
library_name: transformers
pipeline_tag: text-generation
tags:
- IQ4_XS
- chinese
- gguf
- instruct
- iq4
- llama-cpp
- llama3
- text-generation
---

# roleplaiapp/Llama3-Chinese-8B-Instruct-IQ4_XS-GGUF

**Repo:** `roleplaiapp/Llama3-Chinese-8B-Instruct-IQ4_XS-GGUF`
**Original Model:** `Llama3-Chinese-8B-Instruct`
**Quantized File:** `Llama3-Chinese-8B-Instruct.IQ4_XS.gguf`
**Quantization:** `GGUF`
**Quantization Method:** `IQ4_XS`  

## Overview
This is a GGUF IQ4_XS quantized version of Llama3-Chinese-8B-Instruct
## Quantization By
I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models.
I hope the community finds these quantizations useful.

Andrew Webby @ [RolePlai](https://roleplai.app/).