ertghiu256 commited on
Commit
f7218dc
·
verified ·
1 Parent(s): f130b05

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +92 -92
README.md CHANGED
@@ -1,92 +1,92 @@
1
- ---
2
- base_model:
3
- - POLARIS-Project/Polaris-4B-Preview
4
- - ertghiu256/qwen-3-4b-mixture-of-thought
5
- - Qwen/Qwen3-4B-Thinking-2507
6
- - ertghiu256/qwen3-multi-reasoner
7
- - ertghiu256/Qwen3-Hermes-4b
8
- - ertghiu256/deepseek-r1-0528-distilled-qwen3
9
- - Tesslate/UIGEN-T3-4B-Preview-MAX
10
- - ertghiu256/qwen3-math-reasoner
11
- - ValiantLabs/Qwen3-4B-Esper3
12
- - huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
13
- - ertghiu256/qwen3-4b-code-reasoning
14
- - ValiantLabs/Qwen3-4B-ShiningValiant3
15
- library_name: transformers
16
- tags:
17
- - mergekit
18
- - merge
19
-
20
- ---
21
- # Folder Baru (5)
22
-
23
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
24
-
25
- ## Merge Details
26
- ### Merge Method
27
-
28
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) as a base.
29
-
30
- ### Models Merged
31
-
32
- The following models were included in the merge:
33
- * [POLARIS-Project/Polaris-4B-Preview](https://huggingface.co/POLARIS-Project/Polaris-4B-Preview)
34
- * [ertghiu256/qwen-3-4b-mixture-of-thought](https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought)
35
- * [ertghiu256/qwen3-multi-reasoner](https://huggingface.co/ertghiu256/qwen3-multi-reasoner)
36
- * [ertghiu256/Qwen3-Hermes-4b](https://huggingface.co/ertghiu256/Qwen3-Hermes-4b)
37
- * [ertghiu256/deepseek-r1-0528-distilled-qwen3](https://huggingface.co/ertghiu256/deepseek-r1-0528-distilled-qwen3)
38
- * [Tesslate/UIGEN-T3-4B-Preview-MAX](https://huggingface.co/Tesslate/UIGEN-T3-4B-Preview-MAX)
39
- * [ertghiu256/qwen3-math-reasoner](https://huggingface.co/ertghiu256/qwen3-math-reasoner)
40
- * [ValiantLabs/Qwen3-4B-Esper3](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3)
41
- * [huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated)
42
- * [ertghiu256/qwen3-4b-code-reasoning](https://huggingface.co/ertghiu256/qwen3-4b-code-reasoning)
43
- * [ValiantLabs/Qwen3-4B-ShiningValiant3](https://huggingface.co/ValiantLabs/Qwen3-4B-ShiningValiant3)
44
-
45
- ### Configuration
46
-
47
- The following YAML configuration was used to produce this model:
48
-
49
- ```yaml
50
- models:
51
- - model: ertghiu256/qwen3-math-reasoner
52
- parameters:
53
- weight: 0.8
54
- - model: ertghiu256/qwen3-4b-code-reasoning
55
- parameters:
56
- weight: 0.8
57
- - model: ertghiu256/qwen-3-4b-mixture-of-thought
58
- parameters:
59
- weight: 0.9
60
- - model: POLARIS-Project/Polaris-4B-Preview
61
- parameters:
62
- weight: 0.7
63
- - model: ertghiu256/qwen3-multi-reasoner
64
- parameters:
65
- weight: 0.8
66
- - model: ertghiu256/Qwen3-Hermes-4b
67
- parameters:
68
- weight: 0.4
69
- - model: ValiantLabs/Qwen3-4B-Esper3
70
- parameters:
71
- weight: 0.8
72
- - model: Tesslate/UIGEN-T3-4B-Preview-MAX
73
- parameters:
74
- weight: 0.8
75
- - model: ValiantLabs/Qwen3-4B-ShiningValiant3
76
- parameters:
77
- weight: 0.7
78
- - model: ertghiu256/deepseek-r1-0528-distilled-qwen3
79
- parameters:
80
- weight: 0.2
81
- - model: huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
82
- parameters:
83
- weight: 0.6
84
- merge_method: ties
85
- base_model: Qwen/Qwen3-4B-Thinking-2507
86
- parameters:
87
- normalize: true
88
- int8_mask: true
89
- lambda: 1.0
90
- dtype: float16
91
-
92
- ```
 
1
+ ---
2
+ base_model:
3
+ - POLARIS-Project/Polaris-4B-Preview
4
+ - ertghiu256/qwen-3-4b-mixture-of-thought
5
+ - Qwen/Qwen3-4B-Thinking-2507
6
+ - ertghiu256/qwen3-multi-reasoner
7
+ - ertghiu256/Qwen3-Hermes-4b
8
+ - ertghiu256/deepseek-r1-0528-distilled-qwen3
9
+ - Tesslate/UIGEN-T3-4B-Preview-MAX
10
+ - ertghiu256/qwen3-math-reasoner
11
+ - ValiantLabs/Qwen3-4B-Esper3
12
+ - huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
13
+ - ertghiu256/qwen3-4b-code-reasoning
14
+ - ValiantLabs/Qwen3-4B-ShiningValiant3
15
+ library_name: transformers
16
+ tags:
17
+ - mergekit
18
+ - merge
19
+
20
+ ---
21
+ # Ties merged COde MAth aNd Reasoning model
22
+
23
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
24
+
25
+ ## Merge Details
26
+ ### Merge Method
27
+
28
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) as a base.
29
+
30
+ ### Models Merged
31
+
32
+ The following models were included in the merge:
33
+ * [POLARIS-Project/Polaris-4B-Preview](https://huggingface.co/POLARIS-Project/Polaris-4B-Preview)
34
+ * [ertghiu256/qwen-3-4b-mixture-of-thought](https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought)
35
+ * [ertghiu256/qwen3-multi-reasoner](https://huggingface.co/ertghiu256/qwen3-multi-reasoner)
36
+ * [ertghiu256/Qwen3-Hermes-4b](https://huggingface.co/ertghiu256/Qwen3-Hermes-4b)
37
+ * [ertghiu256/deepseek-r1-0528-distilled-qwen3](https://huggingface.co/ertghiu256/deepseek-r1-0528-distilled-qwen3)
38
+ * [Tesslate/UIGEN-T3-4B-Preview-MAX](https://huggingface.co/Tesslate/UIGEN-T3-4B-Preview-MAX)
39
+ * [ertghiu256/qwen3-math-reasoner](https://huggingface.co/ertghiu256/qwen3-math-reasoner)
40
+ * [ValiantLabs/Qwen3-4B-Esper3](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3)
41
+ * [huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated)
42
+ * [ertghiu256/qwen3-4b-code-reasoning](https://huggingface.co/ertghiu256/qwen3-4b-code-reasoning)
43
+ * [ValiantLabs/Qwen3-4B-ShiningValiant3](https://huggingface.co/ValiantLabs/Qwen3-4B-ShiningValiant3)
44
+
45
+ ### Configuration
46
+
47
+ The following YAML configuration was used to produce this model:
48
+
49
+ ```yaml
50
+ models:
51
+ - model: ertghiu256/qwen3-math-reasoner
52
+ parameters:
53
+ weight: 0.8
54
+ - model: ertghiu256/qwen3-4b-code-reasoning
55
+ parameters:
56
+ weight: 0.8
57
+ - model: ertghiu256/qwen-3-4b-mixture-of-thought
58
+ parameters:
59
+ weight: 0.9
60
+ - model: POLARIS-Project/Polaris-4B-Preview
61
+ parameters:
62
+ weight: 0.7
63
+ - model: ertghiu256/qwen3-multi-reasoner
64
+ parameters:
65
+ weight: 0.8
66
+ - model: ertghiu256/Qwen3-Hermes-4b
67
+ parameters:
68
+ weight: 0.4
69
+ - model: ValiantLabs/Qwen3-4B-Esper3
70
+ parameters:
71
+ weight: 0.8
72
+ - model: Tesslate/UIGEN-T3-4B-Preview-MAX
73
+ parameters:
74
+ weight: 0.8
75
+ - model: ValiantLabs/Qwen3-4B-ShiningValiant3
76
+ parameters:
77
+ weight: 0.7
78
+ - model: ertghiu256/deepseek-r1-0528-distilled-qwen3
79
+ parameters:
80
+ weight: 0.2
81
+ - model: huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
82
+ parameters:
83
+ weight: 0.6
84
+ merge_method: ties
85
+ base_model: Qwen/Qwen3-4B-Thinking-2507
86
+ parameters:
87
+ normalize: true
88
+ int8_mask: true
89
+ lambda: 1.0
90
+ dtype: float16
91
+
92
+ ```