ClawBench Leaderboard
🦀
Can AI agents complete everyday online tasks?
Natural Language Processing, Image Generation
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time