Adding Evaluation Results
Browse filesThis is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr
The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.
If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions
README.md
CHANGED
@@ -1,5 +1,7 @@
|
|
1 |
---
|
2 |
-
|
|
|
|
|
3 |
tags:
|
4 |
- llama-2
|
5 |
- instruct
|
@@ -8,14 +10,12 @@ tags:
|
|
8 |
- gpt4
|
9 |
- synthetic data
|
10 |
- distillation
|
|
|
11 |
datasets:
|
12 |
- teknium/openhermes
|
13 |
model-index:
|
14 |
- name: openhermes-13b
|
15 |
results: []
|
16 |
-
license: mit
|
17 |
-
language:
|
18 |
-
- en
|
19 |
---
|
20 |
|
21 |
# OpenHermes-13B
|
@@ -155,4 +155,17 @@ The following hyperparameters were used during training:
|
|
155 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
156 |
- lr_scheduler_type: cosine
|
157 |
- lr_scheduler_warmup_steps: 300
|
158 |
-
- num_epochs: 3
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
license: mit
|
5 |
tags:
|
6 |
- llama-2
|
7 |
- instruct
|
|
|
10 |
- gpt4
|
11 |
- synthetic data
|
12 |
- distillation
|
13 |
+
base_model: NousResearch/Llama-2-13b-hf
|
14 |
datasets:
|
15 |
- teknium/openhermes
|
16 |
model-index:
|
17 |
- name: openhermes-13b
|
18 |
results: []
|
|
|
|
|
|
|
19 |
---
|
20 |
|
21 |
# OpenHermes-13B
|
|
|
155 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
156 |
- lr_scheduler_type: cosine
|
157 |
- lr_scheduler_warmup_steps: 300
|
158 |
+
- num_epochs: 3
|
159 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
160 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/teknium__OpenHermes-13B-details)
|
161 |
+
|
162 |
+
| Metric |Value|
|
163 |
+
|-------------------|----:|
|
164 |
+
|Avg. |12.18|
|
165 |
+
|IFEval (0-Shot) |26.68|
|
166 |
+
|BBH (3-Shot) |18.21|
|
167 |
+
|MATH Lvl 5 (4-Shot)| 1.21|
|
168 |
+
|GPQA (0-shot) | 3.02|
|
169 |
+
|MuSR (0-shot) | 8.53|
|
170 |
+
|MMLU-PRO (5-shot) |15.44|
|
171 |
+
|