Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
zswzswzsw
Upload folder using huggingface_hub
2a4552a
verified
14 days ago
.github
Upload folder using huggingface_hub
14 days ago
assets
Upload folder using huggingface_hub
14 days ago
chapters
Upload folder using huggingface_hub
14 days ago
recipes
Upload folder using huggingface_hub
14 days ago
scripts
Upload folder using huggingface_hub
14 days ago
src
Upload folder using huggingface_hub
14 days ago
tests
Upload folder using huggingface_hub
14 days ago
trl_012_grpo
Upload folder using huggingface_hub
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
.gitignore
3.11 kB
Upload folder using huggingface_hub
14 days ago
CITATION.cff
738 Bytes
Upload folder using huggingface_hub
14 days ago
LICENSE
Safe
11.4 kB
Upload folder using huggingface_hub
14 days ago
Makefile
1.03 kB
Upload folder using huggingface_hub
14 days ago
README.md
8.28 kB
Upload folder using huggingface_hub
14 days ago
config_dpo_run.yaml
2.05 kB
Upload folder using huggingface_hub
14 days ago
config_grpo_offline.yaml
2.17 kB
Upload folder using huggingface_hub
14 days ago
config_sft_test_env.yaml
2.02 kB
Upload folder using huggingface_hub
14 days ago
grpo_max_completion.py
9.37 kB
Upload folder using huggingface_hub
14 days ago
grpo_offline_run.py
8.5 kB
Upload folder using huggingface_hub
14 days ago
run_dpo.py
10.3 kB
Upload folder using huggingface_hub
14 days ago
run_sft_test_env.py
7.86 kB
Upload folder using huggingface_hub
14 days ago
setup.cfg
698 Bytes
Upload folder using huggingface_hub
14 days ago
setup.py
4.9 kB
Upload folder using huggingface_hub
14 days ago
test.json
269 kB
Upload folder using huggingface_hub
14 days ago