Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published 26 days ago • 70
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published 20 days ago • 54
ARC-AGI-2: A New Challenge for Frontier AI Reasoning Systems Paper • 2505.11831 • Published May 17 • 1
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
AI4Research: A Survey of Artificial Intelligence for Scientific Research Paper • 2507.01903 • Published 25 days ago • 4
Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact Paper • 2507.00951 • Published 26 days ago • 22
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications Paper • 2506.12594 • Published Jun 14 • 1
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published Jun 13 • 66
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published May 30 • 95
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Paper • 2505.10468 • Published May 15 • 9
LLMs Will Always Hallucinate, and We Need to Live With This Paper • 2409.05746 • Published Sep 9, 2024 • 5
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models Paper • 2505.10543 • Published May 15 • 1
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback Paper • 2305.14975 • Published May 24, 2023 • 2