merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using bunnycore/LLama-3.1-4B-TitanFusion as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: model_stock
models:
  - model: Nexesenex/Nemotron_W_4b_MagLight_0.1
    parameters:
      weight: 1.0
  - model: FourOhFour/Maelstrom_4B
    parameters:
      weight: 1.0
base_model: bunnycore/LLama-3.1-4B-TitanFusion
dtype: bfloat16
normalize: true

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 15.09
IFEval (0-Shot) 36.27
BBH (3-Shot) 18.55
MATH Lvl 5 (4-Shot) 4.23
GPQA (0-shot) 4.03
MuSR (0-shot) 10.76
MMLU-PRO (5-shot) 16.72
Downloads last month
27
Safetensors
Model size
4.51B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Nexesenex/Nemotron_W_4b_Halo_0.1

Evaluation results