SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M ā¢ 8 items ā¢ Updated 10 days ago ā¢ 163
view article Article š®š¹šÆšµš§š· Generating multilingual instruction datasets with Magpie š¦āā¬ By anakin87 ā¢ 24 days ago ā¢ 18
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper ā¢ 2405.01535 ā¢ Published May 2 ā¢ 116
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. ā¢ 26 items ā¢ Updated about 11 hours ago ā¢ 492
view article Article š¤ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 ā¢ 37
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. ā¢ 6 items ā¢ Updated 30 days ago ā¢ 135
Qwen2-VL Collection Vision-language model series based on Qwen2 ā¢ 15 items ā¢ Updated Sep 18 ā¢ 151
Qwen2-Audio Collection Audio-language model series based on Qwen2 ā¢ 4 items ā¢ Updated Sep 18 ā¢ 44
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. ā¢ 5 items ā¢ Updated Oct 1 ā¢ 45
view article Article Multimodal Augmentation for Documents: Recovering āComprehensionā in āReading and Comprehensionā task By danaaubakirova ā¢ May 16 ā¢ 17
DonaciĆ³n Somos600M Collection ColecciĆ³n de los corpus donados para el Hackathon de SomosNLP 2024: #somos600M ā¢ 4 items ā¢ Updated Mar 9 ā¢ 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. ā¢ 43 items ā¢ Updated Apr 12 ā¢ 117
š¤ TinyLlama Alignment Collection TinyLlama-1.1B model aligned on Intel's Orca dataset. Comparison of DPO/IPO/KTO. ā¢ 3 items ā¢ Updated Mar 22 ā¢ 1
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Paper ā¢ 2401.00448 ā¢ Published Dec 31, 2023 ā¢ 28