GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 20 days ago • 165
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods Paper • 2412.05579 • Published Dec 7, 2024 • 2
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions Paper • 2503.00501 • Published Mar 1 • 12