Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper โข 2505.14810 โข Published 17 days ago โข 60
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper โข 2505.16938 โข Published 15 days ago โข 115