add files

Files changed (10) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

+---
+base_model:
+- CohereForAI/aya-expanse-8b
+---
+This is a converted weight from [aya-expanse-8b](https://huggingface.co/CohereForAI/aya-expanse-8b) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
+## About this Conversion
+This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
+### How 4-bit Quantization Works
+- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
+- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
+This allows for reduced memory usage and faster inference while keeping the model compact.

config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd8be8952489cc2bc18fcadd9dad39469c71003e515571d08b310ab5c8be6a95
+size 1126

generation_config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6c1ed49dfc4d6b1841a8e0081cd51df1f4ab5a92624ffcd08f8b190cb567202c
+size 159

model-00001-of-00002.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:93f443443e5f45aa37f3707e9b4430c5fada691700512b611741a0344b23ac16
+size 4992763965

model-00002-of-00002.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f5609bfaac470450238b65c81c4ee94bfadbc86304644e6e17fe615cd2255ce6
+size 705521116

model.safetensors.index.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6c8d282899e19cf6b7b03d2e3bfb7bdc01b8bec426fad8ca56f7c56724fbfd97
+size 129311

special_tokens_map.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a8a127f35f234ae60b1fcebe12a3c295c04021781a7e1fb97f65f4f89a513fa5
+size 439

tokenizer.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:345ccf04a5257f473e331715ecc69365c5ac8fc2490923fe7155560af809ec1a
+size 20124090

tokenizer_config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:405f2b7e13082bbd73607c670b2bf661b572fdff55b71f5ab34244bc9a307cac
+size 8670