--- library_name: transformers pipeline_tag: text-generation tags: - IQ4_XS - chinese - gguf - instruct - iq4 - llama-cpp - llama3 - text-generation --- # roleplaiapp/Llama3-Chinese-8B-Instruct-IQ4_XS-GGUF **Repo:** `roleplaiapp/Llama3-Chinese-8B-Instruct-IQ4_XS-GGUF` **Original Model:** `Llama3-Chinese-8B-Instruct` **Quantized File:** `Llama3-Chinese-8B-Instruct.IQ4_XS.gguf` **Quantization:** `GGUF` **Quantization Method:** `IQ4_XS` ## Overview This is a GGUF IQ4_XS quantized version of Llama3-Chinese-8B-Instruct ## Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful. Andrew Webby @ [RolePlai](https://roleplai.app/).