Presumed Cultural Identity: How Names Shape LLM Responses Paper β’ 2502.11995 β’ Published 4 days ago β’ 9
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper β’ 2502.03373 β’ Published 16 days ago β’ 51
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 17 days ago β’ 187
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 18 days ago β’ 106
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 9 days ago β’ 90
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ Jan 15 β’ 41
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper β’ 2406.11896 β’ Published Jun 14, 2024 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 40
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Jan 8 β’ 559
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published Dec 5, 2024 β’ 63
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 8 days ago β’ 80