mesolitica
/

Malaysian-Qwen2.5-1.5B-Reasoning-SFT

Model card Files Files and versions

huseinzol05 commited on Jun 18

Commit

0cabd35

·

verified ·

1 Parent(s): 48ae09f

Update README.md

Files changed (1) hide show

README.md +29 -2

README.md CHANGED Viewed

@@ -28,8 +28,6 @@ Finetune on [mesolitica/Malaysian-Reasoning](https://huggingface.co/datasets/mes
 Source code at https://github.com/mesolitica/malaya/tree/master/session/qwen2.5
-## Benchmark
 ### Dialect Translation
 All the benchmarks generate using vLLM, evaluation based on sacrebleu CHRF max@5.
@@ -39,15 +37,44 @@ Source code for evaluation at https://github.com/mesolitica/malaya/tree/master/s
 Dialect to standard Malay,
 ```
 ```
 Standard Malay to dialect,
 ```
 ```
 ### MalayMMLU
 ## Special thanks
 Special thanks to https://www.sns.com.my and Nvidia for 8x H100 node!

 Source code at https://github.com/mesolitica/malaya/tree/master/session/qwen2.5
 ### Dialect Translation
 All the benchmarks generate using vLLM, evaluation based on sacrebleu CHRF max@5.
 Dialect to standard Malay,
 ```
 ```
 Standard Malay to dialect,
 ```
 ```
 ### MalayMMLU
+Accuracy@5,
+```
+```
+While the original model,
+```
+                   Model   Accuracy   shot by_letter        category
+0  Qwen2.5-1.5B-Instruct  57.306590  0shot      True            STEM
+1  Qwen2.5-1.5B-Instruct  52.862595  0shot      True        Language
+2  Qwen2.5-1.5B-Instruct  51.633420  0shot      True  Social science
+3  Qwen2.5-1.5B-Instruct  52.554569  0shot      True          Others
+4  Qwen2.5-1.5B-Instruct  57.224118  0shot      True      Humanities
+{'Social science': 6918, 'Language': 6288, 'Humanities': 4395, 'Others': 4169, 'STEM': 2443}
+Model : Qwen2.5-1.5B-Instruct
+Metric : first
+Shot : 0shot
+average accuracy 53.69842646512204
+accuracy for STEM 57.306590257879655
+accuracy for Language 52.862595419847324
+accuracy for Social science 51.633420063602195
+accuracy for Others 52.554569441112974
+accuracy for Humanities 57.22411831626849
+```
 ## Special thanks
 Special thanks to https://www.sns.com.my and Nvidia for 8x H100 node!