Ryan Marten's picture

Ryan Marten

ryanmarten

·

https://ryanmarten.com

AI & ML interests

None yet

Recent Activity

new activity about 3 hours ago

harborframework/parity-experiments:SpreadsheetBench adapter parity (claude-code + Haiku 4.5, 400 tasks × 3 trials)

new activity 6 days ago

harborframework/terminal-bench-2.0:Define 'harbor' as eval framework 🎉

updated a dataset 7 days ago

harborframework/terminal-bench-2.0

View all activity

Organizations

upvoted a paper 9 months ago

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 52

upvoted a paper 10 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 54

upvoted 3 collections 10 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.68k

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 19 days ago • 46

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated 19 days ago • 18

upvoted an article 11 months ago

Article

Reasoning Datasets Competition

Apr 9, 2025

•

38

upvoted 2 collections 11 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 695

WildChat-50m

All model responses associated with the WildChat-50m paper. • 55 items • Updated Jan 29, 2025 • 9

upvoted 2 collections about 1 year ago

Whisper Release

Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 150

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues. • 4 items • Updated Mar 8, 2025 • 9

upvoted 2 articles about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

Article

Open-R1: Update #1

Feb 2, 2025

•

305

upvoted 2 collections about 1 year ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 183

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 834

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

888

upvoted 5 collections over 1 year ago

MAmmoTH2

Scaling up instruction data from the web for to build better LLMs • 13 items • Updated Dec 9, 2024 • 12

DCLM

DCLM Models + Datasets • 6 items • Updated Aug 25, 2025 • 27

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 14 items • Updated Oct 22, 2025 • 65

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 708

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 44