Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 6 items β’ Updated 1 day ago β’ 58
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper β’ 2504.01724 β’ Published 11 days ago β’ 61
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper β’ 2503.20201 β’ Published 19 days ago β’ 43
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 β’ 384
π©βπ» OlympicCoder Collection Reasoning datasets and models for competitive coding β’ 4 items β’ Updated Mar 11 β’ 16
view article Article LeRobot goes to driving school: Worldβs largest open-source self-driving dataset Mar 11 β’ 73
Unified Reward Model for Multimodal Understanding and Generation Paper β’ 2503.05236 β’ Published Mar 7 β’ 117
AI Engineering Collection A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book. β’ 239 items β’ Updated 15 days ago β’ 16
view article Article Hugging Face and JFrog partner to make AI Security more transparent Mar 4 β’ 21