khazzz1c's picture

1 2 14

khazzz1c

khazic

·

https://github.com/khazic

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 16 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

upvoted a paper 16 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 17 days ago • 156