Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
polaris-73
/
ds1p5b_grpo_math_gsm8k_ppo-global_step_870
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
ds1p5b_grpo_math_gsm8k_ppo-global_step_870
/
tokenizer_config.json
Commit History
Upload ds1p5b_grpo_math_gsm8k_ppo at global_step_870
5d60356
verified
polaris-73
commited on
Jul 14