CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images Paper • 2504.04753 • Published Apr 7 • 1
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward Paper • 2505.19713 • Published 16 days ago • 1
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper • 2505.22914 • Published 13 days ago • 29
ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting Paper • 2503.22346 • Published Mar 28 • 2
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 105
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published Feb 12 • 57
Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks Paper • 2503.13988 • Published Mar 18 • 1
Geospatial Mechanistic Interpretability of Large Language Models Paper • 2505.03368 • Published May 6 • 9
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published May 6 • 93
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 170
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published Apr 28 • 36
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning Paper • 2505.02835 • Published May 5 • 26