view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ 26 days ago β’ 417
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 162
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 267
view article Article Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC By freddyaboulton β’ Apr 9 β’ 26
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 426
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others β’ Mar 4 β’ 74
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other β’ Feb 25 β’ 162
view article Article From Llasa to Llasagna π: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other β’ Feb 11 β’ 29
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.25k
view article Article The AI tools for Art Newsletter - Issue 1 By linoyts and 1 other β’ Jan 31 β’ 79
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 862
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 483
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 15 items β’ Updated Apr 18 β’ 231
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper β’ 2402.13753 β’ Published Feb 21, 2024 β’ 117
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper β’ 2311.06772 β’ Published Nov 12, 2023 β’ 35