view article Article SigLIP 2: A better multilingual vision language encoder about 18 hours ago β’ 47
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 3 days ago β’ 49
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 4 days ago β’ 86
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! β’ 9 items β’ Updated 4 days ago β’ 61
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 10 days ago β’ 48
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) β’ 8 items β’ Updated 4 days ago β’ 40
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated 1 day ago β’ 238
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 17 days ago β’ 187
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper β’ 2502.02492 β’ Published 17 days ago β’ 55
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others β’ Jan 20 β’ 36
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 β’ 25 days ago β’ 18
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper β’ 2501.14677 β’ Published 28 days ago β’ 30
view article Article Mini-R1: Reproduce Deepseek R1 βaha momentβ a RL tutorial By open-r1 β’ 21 days ago β’ 35
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz β’ 28 days ago β’ 12
view article Article π Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 β’ 24 days ago β’ 15
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 26 days ago β’ 359