Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoeReward
/
rl_checkpoints
like
0
Follow
Project of MoE reward model
7
Safetensors
Model card
Files
Files and versions
Community
main
rl_checkpoints
/
qwen1.5_base_rule_base_math_heavy_drgrpo_reward_func
Commit History
drgrpo checkpoints
2581e08
shengyi-qian
commited on
Apr 21