Merge

This is a merge of pre-trained language models created using mergekit.


Model details:

image/png

Do I feel lucky?

V0.1 was spitting out nonsense, so have attempted a different merge method and parameters.


Merge Method

This model was merged using the TIES merge method using anthracite-org/magnum-v4-12b as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: spow12/ChatWaifu_12B_v2.0
    parameters:
      density: 0.25
      weight: 0.25
  - model: anthracite-org/magnum-v4-12b
    parameters:
      density: 0.5
      weight: 0.5

merge_method: ties
base_model: anthracite-org/magnum-v4-12b
parameters:
  normalize: false
  int8_mask: true
dtype: float16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 21.73
IFEval (0-Shot) 33.26
BBH (3-Shot) 32.76
MATH Lvl 5 (4-Shot) 13.29
GPQA (0-shot) 9.73
MuSR (0-shot) 11.54
MMLU-PRO (5-shot) 29.81
Downloads last month
20
Safetensors
Model size
12.2B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Triangle104/Chatty-Harry_V2.0

Merge model
this model
Merges
2 models
Quantizations
2 models

Collection including Triangle104/Chatty-Harry_V2.0

Evaluation results