gz987's picture
Upload README.md
9e50bca verified
|
raw
history blame
700 Bytes

This model is a merged model based on Qwen/Qwen2.5-7B-Instruct using a novel model merging technique.

Performance (Self-Tested on A100)

The following results are obtained using batch_size=6 on an A100 GPU. Official results are pending submission to open_llm_leaderboard.

IFEVAL BBH MATH GPQA MUSR MMLU-PRO AVG
75.21 36.03 48.87 7.61 15.37 37.8 36.815

Note: These results will be updated once officially verified.

Recipe Coming Soon

We will release details on the merging technique and methodology soon. Stay tuned! 🚀