Quants

Provided by @mradermacher

GGUF Static: https://huggingface.co/mradermacher/MT5-Gen2-gemma-2-9B-GGUF

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: zelk12/MT5-Gen2-BI-gemma-2-9B
  - model: zelk12/MT5-Gen2-MMGMAMU-gemma-2-9B
merge_method: slerp
base_model: zelk12/MT5-Gen2-BI-gemma-2-9B
dtype: bfloat16
parameters:
  t: 0.25

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 32.60
IFEval (0-Shot) 79.62
BBH (3-Shot) 44.11
MATH Lvl 5 (4-Shot) 10.35
GPQA (0-shot) 13.53
MuSR (0-shot) 10.44
MMLU-PRO (5-shot) 37.55
Downloads last month
8
Safetensors
Model size
10.2B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for zelk12/MT5-Gen2-gemma-2-9B

Evaluation results