SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines Paper • 2509.21320 • Published 3 days ago • 86
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis Paper • 2509.10441 • Published 16 days ago • 30
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published about 1 month ago • 141
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Paper • 2508.00819 • Published Aug 1 • 62
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data Paper • 2505.23522 • Published May 29 • 2
EarthSE: A Benchmark for Evaluating Earth Scientific Exploration Capability of LLMs Paper • 2505.17139 • Published May 22 • 2
PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models Paper • 2506.17667 • Published Jun 21 • 3
Manalyzer: End-to-end Automated Meta-analysis with Multi-agent System Paper • 2505.20310 • Published May 22 • 1
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published Jun 12 • 73