Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models Paper • 2512.21337 • Published 12 days ago • 28
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness Paper • 2512.15374 • Published 19 days ago • 5