DynMoE model checkpoints and paper on huggingface
-
LINs-lab/DynMoE-StableLM-1.6B
Text Generation • 3B • Updated • 21 • 2 -
LINs-lab/DynMoE-Qwen-1.8B
Text Generation • 3B • Updated • 23 • 2 -
LINs-lab/DynMoE-Phi-2-2.7B
Text Generation • 6B • Updated • 19 • 4 -
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
Paper • 2405.14297 • Published • 2