Update README.md
Browse files
README.md
CHANGED
@@ -2,4 +2,47 @@
|
|
2 |
license: other
|
3 |
license_name: deepseek
|
4 |
license_link: https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICENSE-MODEL
|
|
|
|
|
5 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: other
|
3 |
license_name: deepseek
|
4 |
license_link: https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICENSE-MODEL
|
5 |
+
tags:
|
6 |
+
- merge
|
7 |
---
|
8 |
+
Everyone-Coder-33b-Base
|
9 |
+
|
10 |
+
|
11 |
+
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/ECrHQnZnv8UM9GUCQtlWW.jpeg)
|
12 |
+
|
13 |
+
EveryoneLLM series of models made by the community, for the community. This is a coding specific model made using fine-tunes of deekseekcoder-33b-base.
|
14 |
+
|
15 |
+
The models that were used in this merger were as follow:
|
16 |
+
|
17 |
+
- https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct
|
18 |
+
|
19 |
+
- https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B
|
20 |
+
|
21 |
+
- https://huggingface.co/WizardLM/WizardCoder-33B-V1.1
|
22 |
+
|
23 |
+
Thank you to the creators of the above ai models, they have full credit for the EveryoneLLM series of models. Without their hard work we wouldnt be able to achieve the great success we have in the open source community. 💗
|
24 |
+
|
25 |
+
You can find the write up for merging models here:
|
26 |
+
|
27 |
+
https://docs.google.com/document/d/1_vOftBnrk9NRk5h10UqrfJ5CDih9KBKL61yvrZtVWPE/edit?usp=sharing
|
28 |
+
|
29 |
+
Config for the merger can be found bellow:
|
30 |
+
|
31 |
+
```yaml
|
32 |
+
models:
|
33 |
+
- model: WizardLM_WizardCoder-33B-V1.1
|
34 |
+
parameters:
|
35 |
+
density: 1
|
36 |
+
weight: .5
|
37 |
+
- model: codefuse-ai_CodeFuse-DeepSeek-33B
|
38 |
+
parameters:
|
39 |
+
density: 1
|
40 |
+
weight: .5
|
41 |
+
merge_method: ties
|
42 |
+
base_model: deepseek-ai_deepseek-coder-33b-instruct
|
43 |
+
parameters:
|
44 |
+
normalize: true
|
45 |
+
int8_mask: true
|
46 |
+
dtype: float16
|
47 |
+
|
48 |
+
```
|