Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
zswzswzsw
Upload folder using huggingface_hub
ae40651
verified
14 days ago
README.md
9.66 kB
Upload folder using huggingface_hub
14 days ago
run_cpt.py
7.24 kB
Upload folder using huggingface_hub
14 days ago
run_dpo.py
9.5 kB
Upload folder using huggingface_hub
14 days ago
run_orpo.py
9.8 kB
Upload folder using huggingface_hub
14 days ago
run_sft.py
8.32 kB
Upload folder using huggingface_hub
14 days ago