File size: 861 Bytes
0fb5ca8 d269e5a 8fe4c83 0fb5ca8 9907a3b 0fb5ca8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
license: cc-by-nc-4.0
language:
- en
library_name: CTranslate2
pipeline_tag: text-generation
tags:
- facebook
- meta
- llama
- llama-3
- ct2
- quantized model
- int8
base_model: Sao10K/L3-8B-Stheno-v3.1
base_model_relation: quantized
---
# CTranslate2 int8 version of L3-8B-Stheno-v3.1
This is a int8_bfloat16 quantization of [L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)\
See more on CTranslate2: [Docs](https://opennmt.net/CTranslate2/index.html) | [Github](https://github.com/OpenNMT/CTranslate2)
This model was converted to ct2 format using the following commnd:
```
ct2-transformers-converter --model Sao10K/L3-8B-Stheno-v3.1 --output_dir L3-8B-Stheno-v3.1-ct2 --quantization int8_bfloat16 --low_cpu_mem_usage
```
***no converstion needed using the model from this repository as it is already in ct2 format.***
|