Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper β’ 2506.09250 β’ Published 17 days ago β’ 27
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper β’ 2506.05209 β’ Published 22 days ago β’ 42
One-RL-to-See-Them-All Collection One RL to See Them All: Visual Triple Unified Reinforcement Learning. GitHub: https://github.com/MiniMax-AI/One-RL-to-See-Them-All β’ 5 items β’ Updated 17 days ago β’ 27
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper β’ 2505.17612 β’ Published May 23 β’ 78
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper β’ 2505.22954 β’ Published 30 days ago β’ 11
view article Article π Introducing **Moon**: Storytelling Generator Model By kulia-moon and 1 other β’ 28 days ago β’ 6
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published May 27 β’ 61
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper β’ 2505.19297 β’ Published May 25 β’ 78
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks π±π§πΌβπ» By sasha β’ about 1 month ago β’ 21
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ May 23 β’ 137
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ May 26 β’ 44
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 87