view article Article Featherless AI on Hugging Face Inference Providers π₯ By sbrandeis and 5 others β’ Jun 12 β’ 45
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 487
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning β’ 7 items β’ Updated 2 days ago β’ 52
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others β’ Jan 27 β’ 56
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 447
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 174
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 484
view article Article Deploying Speech-to-Speech on Hugging Face By andito and 3 others β’ Oct 22, 2024 β’ 40
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy By medmekk and 5 others β’ Sep 18, 2024 β’ 261
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other β’ Oct 14, 2024 β’ 95
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated Dec 6, 2024 β’ 625
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper β’ 2408.02545 β’ Published Aug 5, 2024 β’ 39
ReLiK: Retrieve, Read and LinK Collection A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. β’ 20 items β’ Updated Dec 4, 2024 β’ 25
view article Article Docmatix - a huge dataset for Document Visual Question Answering By andito and 1 other β’ Jul 18, 2024 β’ 73
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated May 5 β’ 231