ertghiu256
/

Qwen3-4b-tcomanr-merge-v2

@@ -1,92 +1,92 @@
----
-base_model:
-- POLARIS-Project/Polaris-4B-Preview
-- ertghiu256/qwen-3-4b-mixture-of-thought
-- Qwen/Qwen3-4B-Thinking-2507
-- ertghiu256/qwen3-multi-reasoner
-- ertghiu256/Qwen3-Hermes-4b
-- ertghiu256/deepseek-r1-0528-distilled-qwen3
-- Tesslate/UIGEN-T3-4B-Preview-MAX
-- ertghiu256/qwen3-math-reasoner
-- ValiantLabs/Qwen3-4B-Esper3
-- huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
-- ertghiu256/qwen3-4b-code-reasoning
-- ValiantLabs/Qwen3-4B-ShiningValiant3
-library_name: transformers
-tags:
-- mergekit
-- merge
----
-# Folder Baru (5)
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) as a base.
-### Models Merged
-The following models were included in the merge:
-* [POLARIS-Project/Polaris-4B-Preview](https://huggingface.co/POLARIS-Project/Polaris-4B-Preview)
-* [ertghiu256/qwen-3-4b-mixture-of-thought](https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought)
-* [ertghiu256/qwen3-multi-reasoner](https://huggingface.co/ertghiu256/qwen3-multi-reasoner)
-* [ertghiu256/Qwen3-Hermes-4b](https://huggingface.co/ertghiu256/Qwen3-Hermes-4b)
-* [ertghiu256/deepseek-r1-0528-distilled-qwen3](https://huggingface.co/ertghiu256/deepseek-r1-0528-distilled-qwen3)
-* [Tesslate/UIGEN-T3-4B-Preview-MAX](https://huggingface.co/Tesslate/UIGEN-T3-4B-Preview-MAX)
-* [ertghiu256/qwen3-math-reasoner](https://huggingface.co/ertghiu256/qwen3-math-reasoner)
-* [ValiantLabs/Qwen3-4B-Esper3](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3)
-* [huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated)
-* [ertghiu256/qwen3-4b-code-reasoning](https://huggingface.co/ertghiu256/qwen3-4b-code-reasoning)
-* [ValiantLabs/Qwen3-4B-ShiningValiant3](https://huggingface.co/ValiantLabs/Qwen3-4B-ShiningValiant3)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-  - model: ertghiu256/qwen3-math-reasoner
-    parameters:
-      weight: 0.8
-  - model: ertghiu256/qwen3-4b-code-reasoning
-    parameters:
-      weight: 0.8
-  - model: ertghiu256/qwen-3-4b-mixture-of-thought
-    parameters:
-      weight: 0.9
-  - model: POLARIS-Project/Polaris-4B-Preview
-    parameters:
-      weight: 0.7
-  - model: ertghiu256/qwen3-multi-reasoner
-    parameters:
-      weight: 0.8
-  - model: ertghiu256/Qwen3-Hermes-4b
-    parameters:
-      weight: 0.4
-  - model: ValiantLabs/Qwen3-4B-Esper3
-    parameters:
-      weight: 0.8
-  - model: Tesslate/UIGEN-T3-4B-Preview-MAX
-    parameters:
-      weight: 0.8
-  - model: ValiantLabs/Qwen3-4B-ShiningValiant3
-    parameters:
-      weight: 0.7
-  - model: ertghiu256/deepseek-r1-0528-distilled-qwen3
-    parameters:
-      weight: 0.2
-  - model: huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
-    parameters:
-      weight: 0.6
-merge_method: ties
-base_model: Qwen/Qwen3-4B-Thinking-2507
-parameters:
-  normalize: true
-  int8_mask: true
-  lambda: 1.0
-dtype: float16
-```

+---
+base_model:
+- POLARIS-Project/Polaris-4B-Preview
+- ertghiu256/qwen-3-4b-mixture-of-thought
+- Qwen/Qwen3-4B-Thinking-2507
+- ertghiu256/qwen3-multi-reasoner
+- ertghiu256/Qwen3-Hermes-4b
+- ertghiu256/deepseek-r1-0528-distilled-qwen3
+- Tesslate/UIGEN-T3-4B-Preview-MAX
+- ertghiu256/qwen3-math-reasoner
+- ValiantLabs/Qwen3-4B-Esper3
+- huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
+- ertghiu256/qwen3-4b-code-reasoning
+- ValiantLabs/Qwen3-4B-ShiningValiant3
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# Ties merged COde MAth aNd Reasoning model
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) as a base.
+### Models Merged
+The following models were included in the merge:
+* [POLARIS-Project/Polaris-4B-Preview](https://huggingface.co/POLARIS-Project/Polaris-4B-Preview)
+* [ertghiu256/qwen-3-4b-mixture-of-thought](https://huggingface.co/ertghiu256/qwen-3-4b-mixture-of-thought)
+* [ertghiu256/qwen3-multi-reasoner](https://huggingface.co/ertghiu256/qwen3-multi-reasoner)
+* [ertghiu256/Qwen3-Hermes-4b](https://huggingface.co/ertghiu256/Qwen3-Hermes-4b)
+* [ertghiu256/deepseek-r1-0528-distilled-qwen3](https://huggingface.co/ertghiu256/deepseek-r1-0528-distilled-qwen3)
+* [Tesslate/UIGEN-T3-4B-Preview-MAX](https://huggingface.co/Tesslate/UIGEN-T3-4B-Preview-MAX)
+* [ertghiu256/qwen3-math-reasoner](https://huggingface.co/ertghiu256/qwen3-math-reasoner)
+* [ValiantLabs/Qwen3-4B-Esper3](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3)
+* [huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated)
+* [ertghiu256/qwen3-4b-code-reasoning](https://huggingface.co/ertghiu256/qwen3-4b-code-reasoning)
+* [ValiantLabs/Qwen3-4B-ShiningValiant3](https://huggingface.co/ValiantLabs/Qwen3-4B-ShiningValiant3)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: ertghiu256/qwen3-math-reasoner
+    parameters:
+      weight: 0.8
+  - model: ertghiu256/qwen3-4b-code-reasoning
+    parameters:
+      weight: 0.8
+  - model: ertghiu256/qwen-3-4b-mixture-of-thought
+    parameters:
+      weight: 0.9
+  - model: POLARIS-Project/Polaris-4B-Preview
+    parameters:
+      weight: 0.7
+  - model: ertghiu256/qwen3-multi-reasoner
+    parameters:
+      weight: 0.8
+  - model: ertghiu256/Qwen3-Hermes-4b
+    parameters:
+      weight: 0.4
+  - model: ValiantLabs/Qwen3-4B-Esper3
+    parameters:
+      weight: 0.8
+  - model: Tesslate/UIGEN-T3-4B-Preview-MAX
+    parameters:
+      weight: 0.8
+  - model: ValiantLabs/Qwen3-4B-ShiningValiant3
+    parameters:
+      weight: 0.7
+  - model: ertghiu256/deepseek-r1-0528-distilled-qwen3
+    parameters:
+      weight: 0.2
+  - model: huihui-ai/Huihui-Qwen3-4B-Thinking-2507-abliterated
+    parameters:
+      weight: 0.6
+merge_method: ties
+base_model: Qwen/Qwen3-4B-Thinking-2507
+parameters:
+  normalize: true
+  int8_mask: true
+  lambda: 1.0
+dtype: float16
+```