Xiaohan Fu's picture

2 2

Xiaohan Fu

x5fu

·

https://xhfu.me

AI & ML interests

Security and Safety

Recent Activity

upvoted a paper 12 days ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

authored a paper about 1 month ago

Training Language Models to Generate Quality Code with Program Analysis Feedback

upvoted a paper about 1 month ago

Training Language Models to Generate Quality Code with Program Analysis Feedback

View all activity

Organizations

None yet

upvoted a paper 12 days ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 13 days ago • 43

upvoted a paper about 1 month ago

Training Language Models to Generate Quality Code with Program Analysis Feedback

Paper • 2505.22704 • Published May 28 • 11