|
--- |
|
license: cc-by-nc-4.0 |
|
language: |
|
- en |
|
library_name: CTranslate2 |
|
pipeline_tag: text-generation |
|
tags: |
|
- facebook |
|
- meta |
|
- llama |
|
- llama-3 |
|
- ct2 |
|
- quantized model |
|
- int8 |
|
base_model: Sao10K/L3-8B-Stheno-v3.1 |
|
base_model_relation: quantized |
|
--- |
|
# CTranslate2 int8 version of L3-8B-Stheno-v3.1 |
|
|
|
This is a int8_bfloat16 quantization of [L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)\ |
|
See more on CTranslate2: [Docs](https://opennmt.net/CTranslate2/index.html) | [Github](https://github.com/OpenNMT/CTranslate2) |
|
|
|
This model was converted to ct2 format using the following commnd: |
|
``` |
|
ct2-transformers-converter --model Sao10K/L3-8B-Stheno-v3.1 --output_dir L3-8B-Stheno-v3.1-ct2 --quantization int8_bfloat16 --low_cpu_mem_usage |
|
``` |
|
|
|
***no converstion needed using the model from this repository as it is already in ct2 format.*** |
|
|