Update README.md

Browse files

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -89,7 +89,7 @@ Bitte erkläre mir, wie die Zusammenführung von Modellen durch bestehende Spitz
 ## Evaluation
 ### GPT4ALL:
-*Compared to Aleph Alpha Luminous Models, LeoLM and EM_German*
 ![GPT4ALL diagram](https://vago-solutions.de/wp-content/uploads/2023/11/GPT4All.png "SauerkrautLM-7b-HerO GPT4ALL Diagram")
 ![GPT4ALL table](https://vago-solutions.de/wp-content/uploads/2023/11/GPT4All-Tabelle.png "SauerkrautLM-7b-HerO GPT4ALL Table")
@@ -104,10 +104,10 @@ Bitte erkläre mir, wie die Zusammenführung von Modellen durch bestehende Spitz
 **performed with newest Language Model Evaluation Harness*
 ### MMLU:
-*Compared to Grok0,Grok1,GPT3.5,GPT4*
 ![MMLU](https://vago-solutions.de/wp-content/uploads/2023/11/MMLU-Benchmark.png "SauerkrautLM-7b-HerO MMLU")
 ### TruthfulQA:
-*Compared to GPT3.5,GPT4*
 ![TruthfulQA](https://vago-solutions.de/wp-content/uploads/2023/11/Truthfulqa-Benchmark.png "SauerkrautLM-7b-HerO TruthfulQA")
 ### MT-Bench (German):
@@ -170,6 +170,7 @@ SauerkrautLM-3b-v1                                  2.581250
 open_llama_3b_v2                                    1.456250
 Llama-2-7b                                          1.181250
 ```
 ### MT-Bench (English):
 ![MT-Bench English Diagram](https://vago-solutions.de/wp-content/uploads/2023/11/MT-Bench-Englisch.png "SauerkrautLM-7b-HerO MT-Bench English Diagram")
 ```
@@ -197,7 +198,7 @@ SauerkrautLM-7b-HerO  <---                          7.409375
 Mistral-7B-OpenOrca                                 6.915625
 neural-chat-7b-v3-1                                 6.812500
 ```
 ### Additional German Benchmark results:
 ![GermanBenchmarks](https://vago-solutions.de/wp-content/uploads/2023/11/German-benchmarks.png "SauerkrautLM-7b-HerO German Benchmarks")

 ## Evaluation
 ### GPT4ALL:
+*Compared to relevant German Closed and Open Source models*
 ![GPT4ALL diagram](https://vago-solutions.de/wp-content/uploads/2023/11/GPT4All.png "SauerkrautLM-7b-HerO GPT4ALL Diagram")
 ![GPT4ALL table](https://vago-solutions.de/wp-content/uploads/2023/11/GPT4All-Tabelle.png "SauerkrautLM-7b-HerO GPT4ALL Table")
 **performed with newest Language Model Evaluation Harness*
 ### MMLU:
+*Compared to Big Boy LLMs (Grok0,Grok1,GPT3.5,GPT4)*
 ![MMLU](https://vago-solutions.de/wp-content/uploads/2023/11/MMLU-Benchmark.png "SauerkrautLM-7b-HerO MMLU")
 ### TruthfulQA:
+*Compared to OpenAI Models (GPT3.5,GPT4)*
 ![TruthfulQA](https://vago-solutions.de/wp-content/uploads/2023/11/Truthfulqa-Benchmark.png "SauerkrautLM-7b-HerO TruthfulQA")
 ### MT-Bench (German):
 open_llama_3b_v2                                    1.456250
 Llama-2-7b                                          1.181250
 ```
+**performed with the newest FastChat Version*
 ### MT-Bench (English):
 ![MT-Bench English Diagram](https://vago-solutions.de/wp-content/uploads/2023/11/MT-Bench-Englisch.png "SauerkrautLM-7b-HerO MT-Bench English Diagram")
 ```
 Mistral-7B-OpenOrca                                 6.915625
 neural-chat-7b-v3-1                                 6.812500
 ```
+**performed with the newest FastChat Version*
 ### Additional German Benchmark results:
 ![GermanBenchmarks](https://vago-solutions.de/wp-content/uploads/2023/11/German-benchmarks.png "SauerkrautLM-7b-HerO German Benchmarks")