Jarrod Barnes PRO

Jarrodbarnes

jbarnes850

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

Salesforce/CRMArenaPro

upvoted a paper 4 days ago

SSRL: Self-Search Reinforcement Learning

liked a model 6 days ago

Arc-Intelligence/arc-teacher-8b

View all activity

Organizations

upvoted a paper 4 days ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 8 days ago • 86

upvoted a paper 10 days ago

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Paper • 2507.23751 • Published 22 days ago • 4

upvoted a paper 17 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 18 days ago • 210

upvoted a paper 18 days ago

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9 • 6

upvoted an article 23 days ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

and 4 others •

25 days ago

• 158

upvoted a paper 24 days ago

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Paper • 2504.14286 • Published Apr 19 • 1

upvoted a paper 29 days ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published about 1 month ago • 36

upvoted an article about 1 month ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 47

upvoted 3 papers about 1 month ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11 • 30

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 85

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 280

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 635

upvoted a paper about 2 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29 • 61

upvoted a collection about 2 months ago

VisionLM

Collection

1414 items • Updated 1 day ago • 101

upvoted 2 papers about 2 months ago

Listener-Rewarded Thinking in VLMs for Image Preferences

Paper • 2506.22832 • Published Jun 28 • 23

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28 • 12

upvoted a collection about 2 months ago

QVQ

Collection

QVQ: Qwen models for visual reasoning • 7 items • Updated Jul 21 • 52

upvoted a paper about 2 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 270

upvoted an article 3 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 184

upvoted a paper 3 months ago

TRAIL: Trace Reasoning and Agentic Issue Localization

Paper • 2505.08638 • Published May 13 • 6

Jarrod Barnes PRO

AI & ML interests

Recent Activity

Organizations

Jarrodbarnes's activity

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

SmolLM3: smol, multilingual, long-context reasoner

Let's talk about LLM evaluation