cgus
/

MiniChat-3B-exl2

Text Generation

Model card Files Files and versions Community

cgus commited on Nov 19, 2023

Commit

93b386b

·

1 Parent(s): 72f3170

Update README.md

Files changed (1) hide show

README.md +8 -10

README.md CHANGED Viewed

@@ -13,18 +13,16 @@ Original model: [MiniChat-3B](https://huggingface.co/GeneZC/MiniChat-3B)
 Model creator: [GeneZC](https://huggingface.co/GeneZC)
 [4bpw h6 (main)](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/main)
-4.65bpw h6
-5bpw h6
-5.5bpw h6
-6bpw h6
-4bpw h8
-4.65 h8
-5bpw h8
 5.5bpw h8
-6bpw h8
-8bpw h8
 # Original model card:
 ## MiniChat-3B

 Model creator: [GeneZC](https://huggingface.co/GeneZC)
 [4bpw h6 (main)](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/main)
+[4bpw h8](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/4bpw-h8)
+[4.65 h8](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/4.65bpw-h8)
+[5bpw h8](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/5bpw-h8)
 5.5bpw h8
+[6bpw h8](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/6bpw-h8)
+[8bpw h8](https://huggingface.co/cgus/MiniChat-3B-exl2/tree/8bpw-h8)
+I originally planned to make both h6 and h8 versions for each quant but there was consistent 30MB difference between h6 and h8.
+So I don't see much of a reason to upload the rest of h6.
 # Original model card:
 ## MiniChat-3B