Qwen3-8B-A1.4B-base-unhealed

MoEfied Qwen3-8B-base, and ablated attn layer 16, 29 (by 0-based index)

Downloads last month
1,465
Safetensors
Model size
7.81B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support