61 32 114

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

authored a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

authored a paper 11 days ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

View all activity

Organizations

chujiezheng's activity

commented a paper 18 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 181 •

commented a paper 3 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103 •

commented 2 papers 5 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98 •

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98 •

commented 4 papers 6 months ago

New activity in chujiezheng/Mistral7B-PairRM-SPPO-ExPO 9 months ago

Adding Evaluation Results

#1 opened 9 months ago by

leaderboard-pr-bot

New activity in mistralai/Mistral-7B-Instruct-v0.3 about 1 year ago

no system message?

#14 opened about 1 year ago by

mclassHF2023

commented a paper about 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11 •

New activity in chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO about 1 year ago

Possibly wrong model

#1 opened about 1 year ago by

ByteBrew23

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 year ago

Update README.md

#3 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO about 1 year ago

Update README.md

#2 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/Llama3-70B-Chinese-Chat-ExPO about 1 year ago

Create README.md

#1 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 year ago

Update README.md

#2 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO about 1 year ago

Create README.md

#1 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 year ago

Create README.md

#1 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/LLaMA3-iterative-DPO-final-ExPO about 1 year ago

Create README.md

#1 opened about 1 year ago by

chujiezheng

New activity in chujiezheng/tulu-2-dpo-13b about 1 year ago

Update tokenizer_config.json

#2 opened about 1 year ago by

chujiezheng