MoEfied Qwen3-8B-base, and ablated attn layer 16, 29 (by 0-based index)
Qwen3-8B-base
Chat template
Files info