Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Paper β’ 2403.13248 β’ Published Mar 20, 2024 β’ 78
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22, 2024 β’ 127
Knowledge Distillation of Large Language Models Paper β’ 2306.08543 β’ Published Jun 14, 2023 β’ 20
MambaByte: Token-free Selective State Space Model Paper β’ 2401.13660 β’ Published Jan 24, 2024 β’ 54