Files changed (1) hide show
  1. README.md +73 -63
README.md CHANGED
@@ -1,63 +1,73 @@
1
-
2
- ---
3
-
4
- base_model:
5
- - Qwen/Qwen2.5-Coder-7B-Instruct
6
- - Qwen/Qwen2.5-7B-Instruct
7
- - Qwen/Qwen2.5-Math-7B-Instruct
8
- library_name: transformers
9
- tags:
10
- - mergekit
11
- - merge
12
-
13
-
14
- ---
15
-
16
- [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
17
-
18
-
19
- # QuantFactory/Qwen2.5-7B-Instruct-MathCoder-GGUF
20
- This is quantized version of [DeepMount00/Qwen2.5-7B-Instruct-MathCoder](https://huggingface.co/DeepMount00/Qwen2.5-7B-Instruct-MathCoder) created using llama.cpp
21
-
22
- # Original Model Card
23
-
24
- # merge
25
-
26
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
27
-
28
- ## Merge Details
29
- ### Merge Method
30
-
31
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
32
-
33
- ### Models Merged
34
-
35
- The following models were included in the merge:
36
- * [Qwen/Qwen2.5-Coder-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)
37
- * [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct)
38
-
39
- ### Configuration
40
-
41
- The following YAML configuration was used to produce this model:
42
-
43
- ```yaml
44
- models:
45
- - model: Qwen/Qwen2.5-7B-Instruct
46
- #no parameters necessary for base model
47
- - model: Qwen/Qwen2.5-Math-7B-Instruct
48
- parameters:
49
- density: 0.5
50
- weight: 0.5
51
- - model: Qwen/Qwen2.5-Coder-7B-Instruct
52
- parameters:
53
- density: 0.5
54
- weight: 0.5
55
-
56
- merge_method: ties
57
- base_model: Qwen/Qwen2.5-7B-Instruct
58
- parameters:
59
- normalize: false
60
- int8_mask: true
61
- dtype: float16
62
- ```
63
-
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-Coder-7B-Instruct
4
+ - Qwen/Qwen2.5-7B-Instruct
5
+ - Qwen/Qwen2.5-Math-7B-Instruct
6
+ library_name: transformers
7
+ tags:
8
+ - mergekit
9
+ - merge
10
+ language:
11
+ - zho
12
+ - eng
13
+ - fra
14
+ - spa
15
+ - por
16
+ - deu
17
+ - ita
18
+ - rus
19
+ - jpn
20
+ - kor
21
+ - vie
22
+ - tha
23
+ - ara
24
+ ---
25
+
26
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
27
+
28
+
29
+ # QuantFactory/Qwen2.5-7B-Instruct-MathCoder-GGUF
30
+ This is quantized version of [DeepMount00/Qwen2.5-7B-Instruct-MathCoder](https://huggingface.co/DeepMount00/Qwen2.5-7B-Instruct-MathCoder) created using llama.cpp
31
+
32
+ # Original Model Card
33
+
34
+ # merge
35
+
36
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
37
+
38
+ ## Merge Details
39
+ ### Merge Method
40
+
41
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
42
+
43
+ ### Models Merged
44
+
45
+ The following models were included in the merge:
46
+ * [Qwen/Qwen2.5-Coder-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)
47
+ * [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct)
48
+
49
+ ### Configuration
50
+
51
+ The following YAML configuration was used to produce this model:
52
+
53
+ ```yaml
54
+ models:
55
+ - model: Qwen/Qwen2.5-7B-Instruct
56
+ #no parameters necessary for base model
57
+ - model: Qwen/Qwen2.5-Math-7B-Instruct
58
+ parameters:
59
+ density: 0.5
60
+ weight: 0.5
61
+ - model: Qwen/Qwen2.5-Coder-7B-Instruct
62
+ parameters:
63
+ density: 0.5
64
+ weight: 0.5
65
+
66
+ merge_method: ties
67
+ base_model: Qwen/Qwen2.5-7B-Instruct
68
+ parameters:
69
+ normalize: false
70
+ int8_mask: true
71
+ dtype: float16
72
+ ```
73
+