61 32 114

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

authored a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

authored a paper 11 days ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

View all activity

Organizations

chujiezheng's activity

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 3 days ago • 123

upvoted a paper 14 days ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Paper • 2505.13529 • Published 19 days ago • 11

upvoted 2 papers 18 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 181

Prompt-Driven LLM Safeguarding via Directed Representation Optimization

Paper • 2401.18018 • Published Jan 31, 2024 • 1

upvoted a paper 21 days ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published 21 days ago • 33

upvoted a collection about 1 month ago

Qwen3

Collection

40 items • Updated 16 days ago • 737

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

upvoted 3 papers 5 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

upvoted a collection 5 months ago

QVQ

Collection

QVQ: Qwen models for visual reasoning • 7 items • Updated Apr 28 • 50

upvoted 4 papers 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 51

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 29

upvoted a collection 6 months ago

QwQ

Collection

Qwen with Questions • 6 items • Updated Apr 28 • 95

upvoted an article 7 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

•

Oct 24, 2024

• 12