Files changed (1) hide show
  1. README.md +18 -5
README.md CHANGED
@@ -1,5 +1,7 @@
1
  ---
2
- base_model: NousResearch/Llama-2-13b-hf
 
 
3
  tags:
4
  - llama-2
5
  - instruct
@@ -8,14 +10,12 @@ tags:
8
  - gpt4
9
  - synthetic data
10
  - distillation
 
11
  datasets:
12
  - teknium/openhermes
13
  model-index:
14
  - name: openhermes-13b
15
  results: []
16
- license: mit
17
- language:
18
- - en
19
  ---
20
 
21
  # OpenHermes-13B
@@ -155,4 +155,17 @@ The following hyperparameters were used during training:
155
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
156
  - lr_scheduler_type: cosine
157
  - lr_scheduler_warmup_steps: 300
158
- - num_epochs: 3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ license: mit
5
  tags:
6
  - llama-2
7
  - instruct
 
10
  - gpt4
11
  - synthetic data
12
  - distillation
13
+ base_model: NousResearch/Llama-2-13b-hf
14
  datasets:
15
  - teknium/openhermes
16
  model-index:
17
  - name: openhermes-13b
18
  results: []
 
 
 
19
  ---
20
 
21
  # OpenHermes-13B
 
155
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
156
  - lr_scheduler_type: cosine
157
  - lr_scheduler_warmup_steps: 300
158
+ - num_epochs: 3
159
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
160
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/teknium__OpenHermes-13B-details)
161
+
162
+ | Metric |Value|
163
+ |-------------------|----:|
164
+ |Avg. |12.18|
165
+ |IFEval (0-Shot) |26.68|
166
+ |BBH (3-Shot) |18.21|
167
+ |MATH Lvl 5 (4-Shot)| 1.21|
168
+ |GPQA (0-shot) | 3.02|
169
+ |MuSR (0-shot) | 8.53|
170
+ |MMLU-PRO (5-shot) |15.44|
171
+