xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations Paper • 2506.13651 • Published 9 days ago • 9
CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding Paper • 2405.02384 • Published May 3, 2024
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective Paper • 2502.17262 • Published Feb 24 • 21