drishanarora commited on
Commit
64c4236
·
verified ·
1 Parent(s): 162b38e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -22,8 +22,27 @@ The Cogito LLMs are instruction tuned generative models (text in/text out). All
22
  - In both standard and reasoning modes, Cogito v1-preview models outperform their size equivalent counterparts on common industry benchmarks.
23
  - Each model is trained in over 30 languages and supports a context length of 128k.
24
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  For detailed evaluations, please refer to the [Blog Post](https://www.deepcogito.com/research/cogito-v1-preview).
26
 
 
27
  # Usage
28
  Here is a snippet below for usage with Transformers:
29
 
 
22
  - In both standard and reasoning modes, Cogito v1-preview models outperform their size equivalent counterparts on common industry benchmarks.
23
  - Each model is trained in over 30 languages and supports a context length of 128k.
24
 
25
+
26
+ # Evaluations
27
+ We compare our models against the state of the art size equivalent models in direct mode as well as the reasoning mode. For the direct mode, we compare against Llama / Qwen instruct counterparts. For reasoning, we use Deepseek's R1 distilled counterparts / Qwen's QwQ model.
28
+
29
+ <p align="left">
30
+ <img src="images/8b_benchmarks.png" alt="Logo" width="90%">
31
+ </p>
32
+
33
+ **Livebench Global Average:**
34
+ <p align="left">
35
+ <img src="images/livebench_global_average.png" alt="Logo" width="80%">
36
+ </p>
37
+
38
+ **Tool Calling:**
39
+ <p align="left">
40
+ <img src="images/3b_8b_tool_calling_benchmarks.png" alt="Logo" width="90%">
41
+ </p>
42
+
43
  For detailed evaluations, please refer to the [Blog Post](https://www.deepcogito.com/research/cogito-v1-preview).
44
 
45
+
46
  # Usage
47
  Here is a snippet below for usage with Transformers:
48