Zikang Shan PRO
zkshan2002
·
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
about 13 hours ago
zktmp/r2a-cot-dq1.5-40g8-step260
published
a model
about 13 hours ago
zktmp/r2a-cot-dq1.5-40g8-step260
updated
a model
about 15 hours ago
zktmp/r2a-cot-dq1.5-40g8-step240