Zikang Shan's picture

Zikang Shan PRO

zkshan2002

·

https://zkshan2002.github.io/

zkshan2002

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model about 1 hour ago

zktmp/f11-dq2e-step109

published a model about 21 hours ago

zktmp/f11-dq2e-step109

updated a model 1 day ago

zktmp/fd7-dq2e-step109

View all activity

Organizations

zkshan2002 's models 6

zkshan2002/rm-lsft0

8B • Updated 13 days ago • 17

zkshan2002/rm-lsft

8B • Updated 18 days ago • 116

zkshan2002/rm-q3

8B • Updated 19 days ago • 75

zkshan2002/dl2025_adaptive

8B • Updated Jun 25 • 10

zkshan2002/dl2025_filter

8B • Updated Jun 21 • 8

zkshan2002/dl2025_grpo

8B • Updated Jun 19 • 9