sometimesanotion
/

Qwenvergence-14B-v11

Text Generation

text-generation-inference

Model card Files Files and versions Community

sometimesanotion commited on Jan 30

Commit

d313417

·

verified ·

1 Parent(s): 0da7ee0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ tags:
 ---
 # Notes
-For a model_stock merge, this has greatly exceeded my expectations.  It beats Lamarck v0.7's average without introducing DeepSeek elements, mostly by scoring high on MATH without giving up much elsewhere.  It also shows that the high-scoring Qwen2.5 14B merges are converging near the limits of the architecture.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/665fef5a4794222f6a2fe605/Vj2f_8kD9GBeWr0SEj9qd.png)

 ---
 # Notes
+For a model_stock merge, this has greatly exceeded my expectations.  It beats Lamarck v0.7's average without introducing DeepSeek elements, mostly by scoring high on MATH without giving up much elsewhere.  It also shows that the high-scoring Qwen2.5 14B merges are converging near the limits of the architecture.  Here is how it benchmarks alongside the models it merges.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/665fef5a4794222f6a2fe605/Vj2f_8kD9GBeWr0SEj9qd.png)