1 13 6

Jingming Zhuo

JingmingZ

AI & ML interests

Large Language Models

Recent Activity

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a collection 2 months ago

DR Tulu

View all activity

Organizations

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

upvoted a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

upvoted a collection 2 months ago

DR Tulu

Collection

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated Nov 25, 2025 • 31

liked a dataset 2 months ago

rl-research/dr-tulu-sft-data

Viewer • Updated Nov 25, 2025 • 13.1k • 289 • 25

upvoted a paper 4 months ago

Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization

Paper • 2509.23371 • Published Sep 27, 2025 • 6

updated a dataset 4 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28, 2025 • 500 • 3

published a dataset 4 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28, 2025 • 500 • 3

upvoted a paper 5 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 79

updated a dataset 5 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31, 2025 • 9.88k • 9

published a dataset 5 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31, 2025 • 9.88k • 9

updated a dataset 5 months ago

rl-rag/bc_synthetic_v_2

Viewer • Updated Aug 30, 2025 • 3.99k • 3

published a dataset 5 months ago

rl-rag/bc_synthetic_v_2

Viewer • Updated Aug 30, 2025 • 3.99k • 3

upvoted a paper 5 months ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published Aug 16, 2025 • 71

upvoted a paper 6 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 63

upvoted a paper 8 months ago

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Paper • 2506.05331 • Published Jun 5, 2025 • 13

liked a dataset 8 months ago

xy06/MINT-CoT-Dataset

Viewer • Updated Jun 10, 2025 • 100 • 76 • 7

liked a model 8 months ago

xy06/MINT-CoT-7B

8B • Updated Jun 4, 2025 • 198 • 7

upvoted a paper 10 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3, 2025 • 68

upvoted a paper 11 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 213

liked a Space about 1 year ago

Open LMM Reasoning Leaderboard

🥇

A Leaderboard that demonstrates LMM reasoning capabilities

Jingming Zhuo

AI & ML interests

Recent Activity

Organizations

JingmingZ's activity

Open LMM Reasoning Leaderboard