Gunnar's picture

18 4

Gunnar

grhone

·

grhone

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

OpenThoughts: Data Recipes for Reasoning Models

upvoted a paper 8 days ago

CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

upvoted a paper 8 days ago

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

View all activity

Organizations

None yet

grhone's activity

upvoted a paper 5 days ago

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published 6 days ago • 38

upvoted 2 papers 8 days ago

CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

Paper • 2504.04753 • Published Apr 7 • 1

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

Paper • 2505.19713 • Published 16 days ago • 1

upvoted 2 papers 10 days ago

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published 13 days ago • 29

ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting

Paper • 2503.22346 • Published Mar 28 • 2

upvoted a paper 11 days ago

Synthetic Data RL: Task Definition Is All You Need

Paper • 2505.17063 • Published 24 days ago • 10

upvoted 12 papers about 1 month ago

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 145

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 105

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 57

Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks

Paper • 2503.13988 • Published Mar 18 • 1

Multi-Agent System for Comprehensive Soccer Understanding

Paper • 2505.03735 • Published May 6 • 22

Geospatial Mechanistic Interpretability of Large Language Models

Paper • 2505.03368 • Published May 6 • 9

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 23

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 93

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 170

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 36

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5 • 26

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 76