view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others β’ Nov 26, 2024 β’ 315
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr β’ Feb 11 β’ 41
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others β’ Jun 24, 2024 β’ 194
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model By VictorSanh and 10 others β’ Aug 22, 2023 β’ 33
view article Article Unlocking Longer Generation with Key-Value Cache Quantization By RaushanTurganbay β’ May 16, 2024 β’ 49
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 867
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 287
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques π π By Isayoften β’ Aug 26, 2024 β’ 66
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 331
view article Article π€ PEFT welcomes new merging methods By smangrul and 1 other β’ Feb 19, 2024 β’ 19
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning By Andyrasika β’ Jan 19, 2024 β’ 17
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other β’ Feb 10, 2023 β’ 86
view article Article RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples By airabbitX β’ Aug 16, 2024 β’ 7