SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification Paper โข 2506.15569 โข Published 4 days ago โข 11
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper โข 2501.12380 โข Published Jan 21 โข 86