Zikang Shan's picture

Zikang Shan PRO

zkshan2002
·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model about 13 hours ago
zktmp/r2a-cot-dq1.5-40g8-step260
published a model about 13 hours ago
zktmp/r2a-cot-dq1.5-40g8-step260
updated a model about 15 hours ago
zktmp/r2a-cot-dq1.5-40g8-step240
View all activity

Organizations

Reinforced Token Optimization's profile picture zktmp's profile picture