view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods β’ Feb 17 β’ 22
view article Article Train your ControlNet with diffusers By multimodalart and 1 other β’ Mar 24, 2023 β’ 32
view article Article Drag GAN - Interactive Point-based Manipulation on the Generative Image Manifold By hwaseem04 β’ Dec 17, 2023 β’ 2
Adding Conditional Control to Text-to-Image Diffusion Models Paper β’ 2302.05543 β’ Published Feb 10, 2023 β’ 53
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition Paper β’ 2407.13559 β’ Published Jul 18, 2024 β’ 17
Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach Paper β’ 2406.00409 β’ Published Jun 1, 2024 β’ 1
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others β’ Jun 24, 2024 β’ 194
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x β’ Jun 23, 2024 β’ 90
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition Paper β’ 2406.09630 β’ Published Jun 13, 2024 β’ 2