Qwen
Collection
Alibaba Cloud-based models
•
1246 items
•
Updated
•
5
This is a merge of pre-trained language models created using mergekit.
Qwen2.5 Instruct tied to Reasoning LORA.
This model was merged using the Passthrough merge method using unsloth/Qwen2.5-3B-Instruct + bunnycore/Qwen-2.5-3b-R1-lora_model-v.1 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: unsloth/Qwen2.5-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
dtype: bfloat16
merge_method: passthrough
models:
- model: unsloth/Qwen2.5-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
tokenizer_source: unsloth/Qwen2.5-3B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 24.67 |
IFEval (0-Shot) | 42.14 |
BBH (3-Shot) | 27.20 |
MATH Lvl 5 (4-Shot) | 26.74 |
GPQA (0-shot) | 7.94 |
MuSR (0-shot) | 12.73 |
MMLU-PRO (5-shot) | 31.26 |