Triangle104
/

Chatty-Harry_V2.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Merge

This is a merge of pre-trained language models created using mergekit.

Model details:

Do I feel lucky?

V0.1 was spitting out nonsense, so have attempted a different merge method and parameters.

Merge Method

This model was merged using the TIES merge method using anthracite-org/magnum-v4-12b as a base.

Models Merged

The following models were included in the merge:

spow12/ChatWaifu_12B_v2.0

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: spow12/ChatWaifu_12B_v2.0
    parameters:
      density: 0.25
      weight: 0.25
  - model: anthracite-org/magnum-v4-12b
    parameters:
      density: 0.5
      weight: 0.5

merge_method: ties
base_model: anthracite-org/magnum-v4-12b
parameters:
  normalize: false
  int8_mask: true
dtype: float16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	21.73
IFEval (0-Shot)	33.26
BBH (3-Shot)	32.76
MATH Lvl 5 (4-Shot)	13.29
GPQA (0-shot)	9.73
MuSR (0-shot)	11.54
MMLU-PRO (5-shot)	29.81

Downloads last month: 20

Safetensors

Model size

12.2B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Triangle104/Chatty-Harry_V2.0

anthracite-org/magnum-v4-12b

spow12/ChatWaifu_12B_v2.0

Merge model

this model

Merges

Quantizations

Collection including Triangle104/Chatty-Harry_V2.0

Merges

Personal Merges • 77 items • Updated about 14 hours ago • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

33.260
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

32.760
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

13.290
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

9.730
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

11.540
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

29.810

View on Papers With Code