WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization Paper • 2507.15061 • Published 2 days ago • 31
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning Paper • 2507.12841 • Published 6 days ago • 37
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research Paper • 2507.13300 • Published 5 days ago • 16
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Paper • 2507.11527 • Published 7 days ago • 30
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published 15 days ago • 26
view article Article Open Source All About Data Processing, Dataverse By EujeongChoi • Apr 4, 2024 • 3
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • 27 days ago • 110
Thought Anchors: Which LLM Reasoning Steps Matter? Paper • 2506.19143 • Published 29 days ago • 11