Configuration Parsing Warning: In config.json: "quantization_config.bits" must be an integer

Rombos-LLM-V2.5-Qwen-72b

image/jpeg

Rombos-LLM-V2.5-Qwen-72b is a continues finetuned version of Qwen2.5-72B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method

This version of the model shows higher performance than the original instruct and base models.

Quants: (Coming soon)

GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-72b-GGUF

EXL2:

Benchmarks: (Coming soon)

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 45.39
IFEval (0-Shot) 71.55
BBH (3-Shot) 61.27
MATH Lvl 5 (4-Shot) 47.58
GPQA (0-shot) 19.80
MuSR (0-shot) 17.32
MMLU-PRO (5-shot) 54.83
Downloads last month
9
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for async0x42/Rombos-LLM-V2.5-Qwen-72b-exl2_3.75bpw

Base model

Qwen/Qwen2.5-72B
Quantized
(75)
this model

Evaluation results