Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper β’ 2502.09619 β’ Published Feb 13 β’ 36
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper β’ 2505.22453 β’ Published 9 days ago β’ 45
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others β’ Dec 31, 2024 β’ 1.06k
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 232
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.25k
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 180
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model By merve and 2 others β’ May 14, 2024 β’ 253
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x β’ Jun 23, 2024 β’ 34
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17, 2024 β’ 56
view article Article Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks By lmassaron β’ Feb 21, 2024 β’ 16
view article Article Design choices for Vision Language Models in 2024 By gigant β’ Apr 16, 2024 β’ 28
view article Article Mixture of Experts Explained By osanseviero and 5 others β’ Dec 11, 2023 β’ 666
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others β’ Jan 18, 2024 β’ 63
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others β’ May 24, 2023 β’ 151
Foundation Models for Vision π§© Collection Foundation models for computer vision. β’ 24 items β’ Updated Mar 11, 2024 β’ 20