view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany β’ about 15 hours ago β’ 40
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper β’ 2506.01939 β’ Published 1 day ago β’ 102
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper β’ 2505.21600 β’ Published 7 days ago β’ 67
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper β’ 2505.24864 β’ Published 4 days ago β’ 105
view article Article System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience By codelion β’ 2 days ago β’ 9
TradExpert: Revolutionizing Trading with Mixture of Expert LLMs Paper β’ 2411.00782 β’ Published Oct 16, 2024 β’ 2
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper β’ 2505.18445 β’ Published 11 days ago β’ 63
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper β’ 2505.17894 β’ Published 12 days ago β’ 212
MMaDA: Multimodal Large Diffusion Language Models Paper β’ 2505.15809 β’ Published 13 days ago β’ 85
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper β’ 2505.16938 β’ Published 12 days ago β’ 115
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Paper β’ 2505.18125 β’ Published 11 days ago β’ 110
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. β’ 4 items β’ Updated 5 days ago β’ 145
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ 12 days ago β’ 117
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning Paper β’ 2505.14231 β’ Published 15 days ago β’ 51