XReasoning - models
Collection
https://arxiv.org/abs/2505.22888
ds - means continue post-training on deepseek distilled qwen math 7b
limo-{language}-{amount of data}
•
19 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Linear merge method.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
parameters:
weight: 1.0
- model: shanchen/ds-limo-te-250
parameters:
weight: 0.5
merge_method: linear
dtype: float16