Tim Dingman's picture

3 15 3

Tim Dingman

tdingman-scale

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

OpenThoughts: Data Recipes for Reasoning Models

upvoted a paper 12 days ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

upvoted a paper 12 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

View all activity

Organizations

upvoted a paper 11 days ago

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 43

upvoted 12 papers 12 days ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published 14 days ago • 49

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published May 30 • 44

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 15 days ago • 56

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published 26 days ago • 51

Magistral

Paper • 2506.10910 • Published 26 days ago • 61

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 64

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 73

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 132

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 166

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 258

Reinforcement Pre-Training

Paper • 2506.08007 • Published 29 days ago • 241

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 22 days ago • 250

upvoted a paper over 1 year ago

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Paper • 2311.02262 • Published Nov 3, 2023 • 15

upvoted a paper about 2 years ago

Preference Ranking Optimization for Human Alignment

Paper • 2306.17492 • Published Jun 30, 2023 • 6