Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
polaris-73
/
ds1p5b_grpo_math_gsm8k_ppo-global_step_400
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
ds1p5b_grpo_math_gsm8k_ppo-global_step_400
/
model-00001-of-00002.safetensors
Commit History
Upload ds1p5b_grpo_math_gsm8k_ppo at global_step_400
8769bd7
verified
polaris-73
commited on
Jul 14