36 80 73

Zengzhi Wang

SinclairWang

https://tinyurl.com/zengzhi-homepage

AI & ML interests

Data Engineering for Generative AI

Recent Activity

authored a paper about 15 hours ago

MegaMath: Pushing the Limits of Open Math Corpora

authored a paper about 15 hours ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

new activity about 20 hours ago

OctoThinker/MegaMath-Web-Pro-Max:Still uploading, please stay tuned.

View all activity

Organizations

upvoted a paper 1 day ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published 2 days ago • 30

upvoted a paper 7 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 10 days ago • 42

upvoted a paper about 1 month ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 33

upvoted a collection 2 months ago

DeepSeek-R1

Collection

10 items • Updated 29 days ago • 734

upvoted a paper 2 months ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15 • 11

upvoted an article 3 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 276

upvoted 4 papers 3 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 31

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3 • 30

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16 • 35

upvoted an article 4 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 293

upvoted a collection 4 months ago

ProX Dataset

Collection

a collection of pre-training corpora refined by ProX • 6 items • Updated Feb 14 • 7

upvoted a paper 4 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

upvoted a paper 5 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

upvoted a collection 5 months ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 269

upvoted 2 papers 6 months ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 15

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 66

upvoted a paper 7 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

upvoted an article 8 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 101

upvoted a paper 8 months ago

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19

Zengzhi Wang

AI & ML interests

Recent Activity

Organizations

SinclairWang's activity

SmolVLM2: Bringing Video Understanding to Every Device

Open R1: Update #3

Releasing the largest multilingual open pretraining dataset