view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods • Feb 17 • 25
view article Article Train your ControlNet with diffusers By multimodalart and 1 other • Mar 24, 2023 • 34
view article Article Drag GAN - Interactive Point-based Manipulation on the Generative Image Manifold By hwaseem04 • Dec 17, 2023 • 3
Adding Conditional Control to Text-to-Image Diffusion Models Paper • 2302.05543 • Published Feb 10, 2023 • 54
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition Paper • 2407.13559 • Published Jul 18, 2024 • 18
Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach Paper • 2406.00409 • Published Jun 1, 2024 • 1
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others • Jun 24, 2024 • 200
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23, 2024 • 95
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition Paper • 2406.09630 • Published Jun 13, 2024 • 2