PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
liked
a model
about 2 months ago
stepfun-ai/step3-fp8
liked
a model
about 2 months ago
stepfun-ai/step3
upvoted
a
collection
about 2 months ago
Step3
Organizations
None yet