Multimodal Foundation Models: From Specialists to General-Purpose Assistants Paper • 2309.10020 • Published Sep 18, 2023 • 40
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference Paper • 2308.12066 • Published Aug 23, 2023 • 4
Finding Neurons in a Haystack: Case Studies with Sparse Probing Paper • 2305.01610 • Published May 2, 2023 • 2
Are Emergent Abilities in Large Language Models just In-Context Learning? Paper • 2309.01809 • Published Sep 4, 2023 • 3
Schema-learning and rebinding as mechanisms of in-context learning and emergence Paper • 2307.01201 • Published Jun 16, 2023 • 2
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning Paper • 2312.15685 • Published Dec 25, 2023 • 17