view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 4 days ago β’ 86
view article Article Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other β’ 9 days ago β’ 12
view article Article π#88: Can DeepSeek Inspire Global Collaboration? By Kseniase β’ 4 days ago β’ 3
view article Article Topic 27: What are Chain-of-Agents and Chain-of-RAG? By Kseniase and 1 other β’ 8 days ago β’ 10
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! β’ 9 items β’ Updated 4 days ago β’ 61
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 17 days ago β’ 187
Jan 17 Releases βοΈ Collection Models and datasets of the second week of Jan 2025. β’ 23 items β’ Updated Jan 17 β’ 11
GGUF LoRA adapters Collection Adapters extracted from fine tuned models, using mergekit-extract-lora β’ 16 items β’ Updated 29 days ago β’ 3
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated 1 day ago β’ 238
view article Article Decoding Strategies in Large Language Models By mlabonne β’ Oct 29, 2024 β’ 43
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 227