Delnith
/

Sugoi-14B-Ultra-HF-gptqmodel-4bit

Text Generation

4-bit precision

Model card Files Files and versions

Delnith commited on 11 days ago

Commit

9ef0a24

·

verified ·

1 Parent(s): 7fa61c9

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,5 +1,3 @@
-This is a 4-bit version of Sugoi 14B Ultra, quantized using GPTQmodel and the VNTL-v3.1-1k dataset. This quant should work better than GGUF for certain backends like vLLM and aphrodite-engine, which excel at asynchronous prompting.
 ---
 license: apache-2.0
 datasets:
@@ -14,6 +12,8 @@ pipeline_tag: text-generation
 # Sugoi LLM 14B Ultra (HF version)
 Unleashing the full potential of the previous sugoi 14B model, **Sugoi 14B Ultra** delivers near-double translation accuracy compared to its quantized predecessor—achieving a BLEU score of **21.38 vs 13.67**. Its prompt-following skills rival those of Qwen 2.5 Base, especially when handling the bracket-heavy text commonly found in RPG Maker projects.
 ---

 ---
 license: apache-2.0
 datasets:
 # Sugoi LLM 14B Ultra (HF version)
+This is a 4-bit version of Sugoi 14B Ultra, quantized using GPTQmodel and the VNTL-v3.1-1k dataset. This quant should work better than GGUF for certain backends like vLLM and aphrodite-engine, which excel at asynchronous prompting.
 Unleashing the full potential of the previous sugoi 14B model, **Sugoi 14B Ultra** delivers near-double translation accuracy compared to its quantized predecessor—achieving a BLEU score of **21.38 vs 13.67**. Its prompt-following skills rival those of Qwen 2.5 Base, especially when handling the bracket-heavy text commonly found in RPG Maker projects.
 ---