MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research Paper • 2505.19955 • Published 16 days ago • 10
HardTests: Synthesizing High-Quality Test Cases for LLM Coding Paper • 2505.24098 • Published 12 days ago • 42
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination Paper • 2505.21925 • Published 14 days ago • 35