Yiwei Chen's picture

3

Yiwei Chen

YiweiChen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

updated a model 29 days ago

YiweiChen/WMDP-NPO

published a model 29 days ago

YiweiChen/WMDP-NPO

View all activity

Organizations

upvoted a paper about 7 hours ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published 4 days ago • 109

upvoted a paper 6 months ago

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning

Paper • 2504.00487 • Published Apr 1 • 18

upvoted a collection 11 months ago

SimNPO-Unlearned Models

This collection hosts the SimNPO-unlearned models over TOFU, MUSE, and WMDP unlearning benchmarks. • 7 items • Updated Aug 8 • 2