Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zikang Shan's picture

Zikang Shan PRO

zkshan2002
·
https://zkshan2002.github.io/
  • zkshan2002

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model about 1 hour ago
zktmp/f11-dq2e-step109
published a model about 21 hours ago
zktmp/f11-dq2e-step109
updated a model 1 day ago
zktmp/fd7-dq2e-step109
View all activity

Organizations

Reinforced Token Optimization's profile picture zktmp's profile picture

zkshan2002 's models 6

zkshan2002/rm-lsft0

8B • Updated 13 days ago • 17

zkshan2002/rm-lsft

8B • Updated 18 days ago • 116

zkshan2002/rm-q3

8B • Updated 19 days ago • 75

zkshan2002/dl2025_adaptive

8B • Updated Jun 25 • 10

zkshan2002/dl2025_filter

8B • Updated Jun 21 • 8

zkshan2002/dl2025_grpo

8B • Updated Jun 19 • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs