11 28 7

WeihaoZeng

AndrewZeng

https://github.com/Zeng-WH

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

OpenCUA: Open Foundations for Computer-Use Agents

upvoted a paper 22 days ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

upvoted a paper 26 days ago

Agentic Reinforced Policy Optimization

View all activity

Organizations

upvoted a paper 7 days ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published 12 days ago • 27

upvoted a paper 22 days ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published 24 days ago • 108

upvoted a paper 26 days ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 29 days ago • 141

upvoted a paper about 1 month ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

upvoted 5 papers 3 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 177

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published May 28 • 6

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 103

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published May 26 • 67

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 42

updated a dataset 3 months ago

AndrewZeng/hacking_deepscalaer

Viewer • Updated May 25 • 6.12k • 12

published a dataset 3 months ago

AndrewZeng/hacking_deepscalaer

Viewer • Updated May 25 • 6.12k • 12

upvoted 3 papers 3 months ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20 • 63

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 34

updated a dataset 4 months ago

AndrewZeng/SimpleRL-SFT

Viewer • Updated Apr 17 • 7.62k • 1

published a dataset 4 months ago

AndrewZeng/SimpleRL-SFT

Viewer • Updated Apr 17 • 7.62k • 1

upvoted a collection 4 months ago

SimpleRL

Collection

The collection for the Project "Simple Reinforcement Learning for Reasoning" • 2 items • Updated Feb 19 • 7

updated a dataset 5 months ago

AndrewZeng/math_level1to5_qwen_prompt

Viewer • Updated Apr 2 • 12k • 22 • 1

published a dataset 5 months ago

AndrewZeng/math_level1to5_qwen_prompt

Viewer • Updated Apr 2 • 12k • 22 • 1

upvoted a paper 5 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 46

WeihaoZeng

AI & ML interests

Recent Activity

Organizations

AndrewZeng's activity