MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published 13 days ago • 24
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework Paper • 2412.11713 • Published Dec 16, 2024 • 6
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations Paper • 2410.22821 • Published Oct 30, 2024 • 2
Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach Paper • 2410.06949 • Published Oct 9, 2024 • 6
Real-time Holistic Robot Pose Estimation with Unknown States Paper • 2402.05655 • Published Feb 8, 2024
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories Paper • 2405.19856 • Published May 30, 2024 • 9