Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,15 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
```python
|
5 |
+
# To use the model
|
6 |
+
from transformers import AutoModelForCausalLM
|
7 |
+
model = AutoModelForCausalLM.from_pretrained("Shengkun/Qwen3-16B-A2B-Pruned")
|
8 |
+
```
|
9 |
+
|
10 |
+
**16B**
|
11 |
+
|
12 |
+
| Model | Method | Param. | SciQ | PIQA | WG | ArcE | ArcC | HS | LogiQA | BoolQ | MMLU | Avg |
|
13 |
+
|-----------------|------------------------|--------|------|------|------|------|------|------|--------|-------|------|------|
|
14 |
+
| **Qwen3-30B-A3B** | **Dense** | 30B A3B | 97.0 | 79.7 | 71.5 | 79.7 | 68.8 | 77.8 | 34.7 | 88.8 | 79.6 | 75.2 |
|
15 |
+
| | **Uniform** | 16B A2B | 94.9 | 71.4 | 60.2 | 73.2 | 52.6 | 47.0 | 33.2 | 75.0 | 55.6 | 62.5 |
|