SmolVLM2 πΊ Smallest video LM ever π€π» Collection 11 items β’ Updated about 1 month ago β’ 65
RWKV: Reinventing RNNs for the Transformer Era Paper β’ 2305.13048 β’ Published May 22, 2023 β’ 19
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms Paper β’ 2410.23144 β’ Published Oct 30, 2024 β’ 4
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated Feb 20 β’ 251
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Feb 20 β’ 51
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated Feb 20 β’ 220
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Paper β’ 2311.05556 β’ Published Nov 9, 2023 β’ 87
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper β’ 2406.16860 β’ Published Jun 24, 2024 β’ 60
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance Paper β’ 2307.00522 β’ Published Jul 2, 2023 β’ 32
Learning and Leveraging World Models in Visual Representation Learning Paper β’ 2403.00504 β’ Published Mar 1, 2024 β’ 33
Probing the 3D Awareness of Visual Foundation Models Paper β’ 2404.08636 β’ Published Apr 12, 2024 β’ 13
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper β’ 2407.03471 β’ Published Jul 3, 2024 β’ 31