MoE-LLaVA: Mixture of Experts for Large Vision-Language Models Paper โข 2401.15947 โข Published Jan 29, 2024 โข 51 โข 4
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper โข 2401.14196 โข Published Jan 25, 2024 โข 60 โข 4