35 12 1

Changyu Chen PRO

Cameron-Chen

AI & ML interests

Generative Models, LLMs, Reinforcement Learning.

Recent Activity

upvoted a paper 12 days ago

Fostering Video Reasoning via Next-Event Prediction

upvoted a paper 13 days ago

Reinforcing General Reasoning without Verifiers

upvoted a paper 14 days ago

Lifelong Safety Alignment for Language Models

View all activity

Organizations

Cameron-Chen's activity

upvoted a paper 12 days ago

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published 13 days ago • 27

upvoted a paper 13 days ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published 13 days ago • 26

upvoted a paper 14 days ago

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published 14 days ago • 23

upvoted a paper 20 days ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published 21 days ago • 35

upvoted a paper about 2 months ago

Efficient Process Reward Model Training via Active Learning

Paper • 2504.10559 • Published Apr 14 • 13

upvoted a paper 2 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 50

upvoted a collection 6 months ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 6 days ago • 28

upvoted a paper 7 months ago

Sample-Efficient Alignment for LLMs

Paper • 2411.01493 • Published Nov 3, 2024 • 12

upvoted 2 collections 11 months ago

Gemma 2 Release

Collection

15 items • Updated 11 days ago • 219

💡 DICE

Collection

Self-alignment with DPO Implicit Rewards • 5 items • Updated Jul 28, 2024 • 9

upvoted a paper 11 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 39

upvoted a paper 12 months ago

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 41