view article Article Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity By AviSoori1x • Mar 18, 2024 • 11
view article Article Using LoRA for Efficient Stable Diffusion Fine-Tuning By pcuenq and 1 other • Jan 26, 2023 • 66
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation Paper • 2402.14874 • Published Feb 21, 2024 • 4