switch-base-128_qmoe
This is the google/switch-base-128 model quantized with the QMoE framework to ternary precision and stored in the custom further compressed QMoE format.
Please see the QMoE repository for how to use this model.
- Downloads last month
- 1
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support