merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using openlynn/Llama-3-Soliloquy-8B-v2 as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
base_model: openlynn/Llama-3-Soliloquy-8B-v2
dtype: bfloat16
merge_method: dare_ties
parameters:
int8_mask: 1.0
normalize: 0.0
slices:
- sources:
- layer_range: [0, 4]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 1.0
weight: 0.6861808716092435
- layer_range: [0, 4]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.6628290134113985
weight: 0.5815923052193855
- layer_range: [0, 4]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 1.0
weight: 0.5113886163963061
- sources:
- layer_range: [4, 8]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.892655547455918
weight: 0.038732602391021484
- layer_range: [4, 8]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 1.0
weight: 0.1982145486303527
- layer_range: [4, 8]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 1.0
weight: 0.6843011350690802
- sources:
- layer_range: [8, 12]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 0.7817511027396784
weight: 0.13053333213489704
- layer_range: [8, 12]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.6963703515864826
weight: 0.20525481492667985
- layer_range: [8, 12]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 0.6983086326765777
weight: 0.5843953969574106
- sources:
- layer_range: [12, 16]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 0.9632895768462915
weight: 0.2101146706607748
- layer_range: [12, 16]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.597557434542081
weight: 0.6728172621848589
- layer_range: [12, 16]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 0.756263557607837
weight: 0.2581423726361908
- sources:
- layer_range: [16, 20]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 1.0
weight: 0.2116035543552448
- layer_range: [16, 20]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 1.0
weight: 0.22654226422958418
- layer_range: [16, 20]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 0.8925914810507647
weight: 0.42243766315440867
- sources:
- layer_range: [20, 24]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 0.7697608089825734
weight: 0.1535118632140203
- layer_range: [20, 24]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.9886758076773643
weight: 0.3305040603868546
- layer_range: [20, 24]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 1.0
weight: 0.40670083428654535
- sources:
- layer_range: [24, 28]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 1.0
weight: 0.4542810478500622
- layer_range: [24, 28]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.8330662483310117
weight: 0.2587495367324508
- layer_range: [24, 28]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 0.9845313983551542
weight: 0.40378452705975915
- sources:
- layer_range: [28, 32]
model: NousResearch/Hermes-2-Pro-Llama-3-8B
parameters:
density: 1.0
weight: 0.2951962192288415
- layer_range: [28, 32]
model: cgato/L3-TheSpice-8b-v0.8.3
parameters:
density: 0.960315594933433
weight: 0.13142971773782525
- layer_range: [28, 32]
model: openlynn/Llama-3-Soliloquy-8B-v2
parameters:
density: 1.0
weight: 0.30838472094518804
- Downloads last month
- 2
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for HyperBlaze/L3-8B-Soliloquy-TheSpice-Hermes2pro-merge-v1
Merge model
this model