DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 3 days ago • 71
ViExam: Are Vision Language Models Better than Humans on Vietnamese Multimodal Exam Questions? Paper • 2508.13680 • Published 4 days ago • 5
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published 3 days ago • 24
From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery Paper • 2508.14111 • Published 5 days ago • 25
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction Paper • 2508.11987 • Published 7 days ago • 54
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published 17 days ago • 100
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Paper • 2508.13009 • Published 5 days ago • 21
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds Paper • 2508.12782 • Published 5 days ago • 23
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published 11 days ago • 22
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published 12 days ago • 45
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 16 days ago • 116
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 12 days ago • 38
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published 13 days ago • 82
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 15 days ago • 155
ChartCap: Mitigating Hallucination of Dense Chart Captioning Paper • 2508.03164 • Published 18 days ago • 5
Tool-integrated Reinforcement Learning for Repo Deep Search Paper • 2508.03012 • Published 18 days ago • 18