MERGE2
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Linear DELLA merge method using nbeerbower/Llama-3.1-Nemotron-lorablated-70B as a base.
Models Merged
The following models were included in the merge:
- knifeayumu/Negative-Anubis-70B-v1
- TareksLab/Transgression-V2-LLaMa-70B
- Mawdistical/Wanton-Wolf-70B
- TheDrummer/Fallen-Llama-3.3-R1-70B-v1
- allura-org/Bigger-Body-70b
Configuration
The following YAML configuration was used to produce this model:
models:
- model: TheDrummer/Fallen-Llama-3.3-R1-70B-v1
parameters:
weight: 0.20
density: 0.7
- model: TareksLab/Transgression-V2-LLaMa-70B
parameters:
weight: 0.20
density: 0.7
- model: Mawdistical/Wanton-Wolf-70B
parameters:
weight: 0.20
density: 0.7
- model: allura-org/Bigger-Body-70b
parameters:
weight: 0.20
density: 0.7
- model: knifeayumu/Negative-Anubis-70B-v1
parameters:
weight: 0.20
density: 0.7
merge_method: della_linear
base_model: nbeerbower/Llama-3.1-Nemotron-lorablated-70B
parameters:
epsilon: 0.2
lambda: 1.1
normalize: false
dtype: float32
out_dtype: bfloat16
chat_template: llama3
tokenizer:
source: Mawdistical/Wanton-Wolf-70B
pad_to_multiple_of: 8
- Downloads last month
- 0
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support