huggingkot commited on
Commit
b066182
·
1 Parent(s): 60e1888
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ base_model:
4
+ - SicariusSicariiStuff/Fiendish_LLAMA_3B
5
+ ---
6
+
7
+ This is a converted weight from [Fiendish_LLAMA_3B](https://huggingface.co/SicariusSicariiStuff/Fiendish_LLAMA_3B) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
8
+
9
+ ## About this Conversion
10
+
11
+ This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
12
+
13
+ ### How 4-bit Quantization Works
14
+ - The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
15
+ - **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
16
+
17
+ This allows for reduced memory usage and faster inference while keeping the model compact.
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09e36c922f54d4cc26c5cf5d1169a174dc7d89c5439ef3136132230c5bdaa9fd
3
+ size 1346
generation_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:26053a51493fbd63cb5dd889c37d656f3c1b90e042f98afb133bd76ef289a735
3
+ size 193
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f0090a4dbfcb617db893aae6b885c56e69152f4e9e05ff6303b9b43bd1f2343
3
+ size 2242762535
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a6d7fa83a01e8192333cd7b848541159709c4b206739071980432612f807807
3
+ size 444
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b9e4e7fb171f92fd137b777cc2714bf87d11576700a1dcd7a399e7bbe39537b
3
+ size 17209920
tokenizer_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19797bcf8875088bcc0a7bd0997cc1cc748092339b855a11152568a8a947ae85
3
+ size 51096