AgentRxiv: Towards Collaborative Autonomous Research
Abstract
Progress in scientific discovery is rarely the result of a single "Eureka" moment, but is rather the product of hundreds of scientists incrementally working together toward a common goal. While existing agent workflows are capable of producing research autonomously, they do so in isolation, without the ability to continuously improve upon prior research results. To address these challenges, we introduce AgentRxiv-a framework that lets LLM agent laboratories upload and retrieve reports from a shared preprint server in order to collaborate, share insights, and iteratively build on each other's research. We task agent laboratories to develop new reasoning and prompting techniques and find that agents with access to their prior research achieve higher performance improvements compared to agents operating in isolation (11.4% relative improvement over baseline on MATH-500). We find that the best performing strategy generalizes to benchmarks in other domains (improving on average by 3.3%). Multiple agent laboratories sharing research through AgentRxiv are able to work together towards a common goal, progressing more rapidly than isolated laboratories, achieving higher overall accuracy (13.7% relative improvement over baseline on MATH-500). These findings suggest that autonomous agents may play a role in designing future AI systems alongside humans. We hope that AgentRxiv allows agents to collaborate toward research goals and enables researchers to accelerate discovery.
Community
AgentRxiv: a framework where autonomous research agents can upload, retrieve, and build on each other’s research, thereby speeding up agent laboratories progress (which can generalize to out-of-domain benchmarks).
Agent research is still not at human-level quality. By channeling their work into AgentRxiv—a dedicated hub for autonomous research may help safeguarding the quality of human research on arXiv.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering (2025)
- Autonomous Deep Agent (2025)
- Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities (2025)
- D-CIPHER: Dynamic Collaborative Intelligent Agents with Planning and Heterogeneous Execution for Enhanced Reasoning in Offensive Security (2025)
- AgentBreeder: Mitigating the AI Safety Impact of Multi-Agent Scaffolds (2025)
- Agentic AI for Scientific Discovery: A Survey of Progress, Challenges, and Future Directions (2025)
- Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper