Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Paper • 2403.13248 • Published Mar 20, 2024 • 78
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 133
Knowledge Distillation of Large Language Models Paper • 2306.08543 • Published Jun 14, 2023 • 20
MambaByte: Token-free Selective State Space Model Paper • 2401.13660 • Published Jan 24, 2024 • 61