This model is a merged model based on Qwen/Qwen2.5-7B-Instruct using a novel model merging technique.
Performance (Self-Tested on A100)
The following results are obtained using batch_size=6 on an A100 GPU. Official results are pending submission to open_llm_leaderboard
.
IFEVAL | BBH | MATH | GPQA | MUSR | MMLU-PRO | AVG |
---|---|---|---|---|---|---|
75.21 | 36.03 | 48.87 | 7.61 | 15.37 | 37.8 | 36.815 |
Note: These results will be updated once officially verified.
Recipe Coming Soon
We will release details on the merging technique and methodology soon. Stay tuned! 🚀