Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent β’ 2 items β’ Updated 1 day ago β’ 28
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany β’ about 15 hours ago β’ 40
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published 1 day ago β’ 50
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper β’ 2505.24760 β’ Published 5 days ago β’ 52
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper β’ 2506.01939 β’ Published 1 day ago β’ 102
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others β’ 1 day ago β’ 42
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper β’ 2505.24863 β’ Published 4 days ago β’ 75
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper β’ 2505.24864 β’ Published 4 days ago β’ 105
Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper β’ 2505.24867 β’ Published 4 days ago β’ 70
Red Hat AI validated models - v1.0 Collection v1.0 Collection of third-party generative AI models validated by Red Hat AI for use across the Red Hat AI Product Portfolio. β’ 39 items β’ Updated 7 days ago β’ 3
view article Article CodeAgents + Structure: AΒ Better Way to Execute Actions By akseljoonas and 1 other β’ May 28, 2024 β’ 37
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs Paper β’ 2505.21327 β’ Published 8 days ago β’ 81
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Paper β’ 2505.19897 β’ Published 9 days ago β’ 100
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation Paper β’ 2505.20292 β’ Published 8 days ago β’ 52
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks π±π§πΌβπ» By sasha β’ 7 days ago β’ 18
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Paper β’ 2505.18125 β’ Published 11 days ago β’ 110
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper β’ 2505.19457 β’ Published 9 days ago β’ 61
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper β’ 2505.19147 β’ Published 10 days ago β’ 142
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper β’ 2505.17894 β’ Published 12 days ago β’ 212