KRDModel

KRDModel is a merge of the following models using mergekit:

🧩 Configuration

slices:
- sources:
  - model: prithivMLmods/Qwen2.5-14B-DeepSeek-R1-1M
    layer_range:
    - 0
    - 32
  - model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
    layer_range:
    - 0
    - 32
merge_method: slerp
base_model: prithivMLmods/Qwen2.5-14B-DeepSeek-R1-1M
parameters:
  t:
  - filter: self_attn
    value:
    - 0
    - 0.5
    - 0.3
    - 0.7
    - 1
  - filter: mlp
    value:
    - 1
    - 0.5
    - 0.7
    - 0.3
    - 0
  - value: 0.5
dtype: bfloat16
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support