L3-SthenoMaidBlackroot-12.2B-V1-INSTRUCT-16fp

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

  • G:/7B/L3-SthenoMaidBlackroot-8B-V1
  • G:/7B/Meta-Llama-3-8B-Instruct

Configuration

The following YAML configuration was used to produce this model:

slices:
 - sources:
   - model: G:/7B/Meta-Llama-3-8B-Instruct
     layer_range: [0, 12]
 - sources:
   - model: G:/7B/L3-SthenoMaidBlackroot-8B-V1
     layer_range: [6, 19]
     parameters:
       scale:
         - filter: o_proj
           value: 1
         - filter: down_proj
           value: 1
         - value: 1
 - sources:
   - model: G:/7B/Meta-Llama-3-8B-Instruct
     layer_range: [12, 25]
 - sources:
   - model: G:/7B/L3-SthenoMaidBlackroot-8B-V1
     layer_range: [19, 32]
     parameters:
       scale:
         - filter: o_proj
           value: 1
         - filter: down_proj
           value: 1
         - value: 1
merge_method: passthrough
dtype: float16
Downloads last month
8
Safetensors
Model size
12.2B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for DavidAU/L3-SthenoMaidBlackroot-12.2B-V1-INSTRUCT-16fp

Quantizations
2 models