dranger003
/

Smaug-72B-v0.1-iMat.GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

dranger003 commited on Mar 15

Commit

5f26f87

•

1 Parent(s): 340c17a

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -8,6 +8,9 @@ pipeline_tag: text-generation
 GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-72B-v0.1
 The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.
 **Update 2024-03-02**:
 * New quants IQ2_S/IQ2_M, requires commit [a33e6a0d](https://github.com/ggerganov/llama.cpp/commit/a33e6a0d2a66104ea9a906bdbf8a94d050189d91) or later.
 * The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a [general purpose imatrix calibration dataset](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384).

 GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-72B-v0.1
 The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.
+**Update 2024-03-14**:
+* New quant IQ1_S using latest commit `4755afd1`.
 **Update 2024-03-02**:
 * New quants IQ2_S/IQ2_M, requires commit [a33e6a0d](https://github.com/ggerganov/llama.cpp/commit/a33e6a0d2a66104ea9a906bdbf8a94d050189d91) or later.
 * The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a [general purpose imatrix calibration dataset](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384).