deepcogito
/

cogito-v1-preview-llama-8B

Text Generation

text-generation-inference

Model card Files Files and versions Community

drishanarora commited on Apr 8

Commit

64c4236

·

verified ·

1 Parent(s): 162b38e

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -22,8 +22,27 @@ The Cogito LLMs are instruction tuned generative models (text in/text out). All
   - In both standard and reasoning modes, Cogito v1-preview models outperform their size equivalent counterparts on common industry benchmarks.
 - Each model is trained in over 30 languages and supports a context length of 128k.
 For detailed evaluations, please refer to the [Blog Post](https://www.deepcogito.com/research/cogito-v1-preview).
 # Usage
 Here is a snippet below for usage with Transformers:

   - In both standard and reasoning modes, Cogito v1-preview models outperform their size equivalent counterparts on common industry benchmarks.
 - Each model is trained in over 30 languages and supports a context length of 128k.
+# Evaluations
+We compare our models against the state of the art size equivalent models in direct mode as well as the reasoning mode. For the direct mode, we compare against Llama / Qwen instruct counterparts. For reasoning, we use Deepseek's R1 distilled counterparts / Qwen's QwQ model.
+<p align="left">
+  <img src="images/8b_benchmarks.png" alt="Logo" width="90%">
+</p>
+**Livebench Global Average:**
+<p align="left">
+  <img src="images/livebench_global_average.png" alt="Logo" width="80%">
+</p>
+**Tool Calling:**
+<p align="left">
+  <img src="images/3b_8b_tool_calling_benchmarks.png" alt="Logo" width="90%">
+</p>
 For detailed evaluations, please refer to the [Blog Post](https://www.deepcogito.com/research/cogito-v1-preview).
 # Usage
 Here is a snippet below for usage with Transformers: