MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 โข 9 items โข Updated Nov 27, 2024 โข 103
๐ช SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos โข 12 items โข Updated 27 days ago โข 209
view article Article How I train a LoRA: m3lt style training overview By alvdansen โข Jul 1, 2024 โข 49
view article Article Recommendation to Revisit the Diffuser Default LoRA Parameters By alvdansen โข Jun 21, 2024 โข 11
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 โข 173
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper โข 2404.03715 โข Published Apr 4, 2024 โข 61
Open LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: โข 65 items โข Updated about 1 hour ago โข 508