Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
zswzswzsw
Upload folder using huggingface_hub
ae40651
verified
3 months ago
README.md
Safe
9.66 kB
Upload folder using huggingface_hub
3 months ago
run_cpt.py
Safe
7.24 kB
Upload folder using huggingface_hub
3 months ago
run_dpo.py
Safe
9.5 kB
Upload folder using huggingface_hub
3 months ago
run_orpo.py
Safe
9.8 kB
Upload folder using huggingface_hub
3 months ago
run_sft.py
Safe
8.32 kB
Upload folder using huggingface_hub
3 months ago