Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
zswzswzsw
Upload folder using huggingface_hub
2a4552a
verified
3 months ago
.github
Upload folder using huggingface_hub
3 months ago
assets
Upload folder using huggingface_hub
3 months ago
chapters
Upload folder using huggingface_hub
3 months ago
recipes
Upload folder using huggingface_hub
3 months ago
scripts
Upload folder using huggingface_hub
3 months ago
src
Upload folder using huggingface_hub
3 months ago
tests
Upload folder using huggingface_hub
3 months ago
trl_012_grpo
Upload folder using huggingface_hub
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
.gitignore
Safe
3.11 kB
Upload folder using huggingface_hub
3 months ago
CITATION.cff
Safe
738 Bytes
Upload folder using huggingface_hub
3 months ago
LICENSE
Safe
11.4 kB
Upload folder using huggingface_hub
3 months ago
Makefile
Safe
1.03 kB
Upload folder using huggingface_hub
3 months ago
README.md
Safe
8.28 kB
Upload folder using huggingface_hub
3 months ago
config_dpo_run.yaml
Safe
2.05 kB
Upload folder using huggingface_hub
3 months ago
config_grpo_offline.yaml
2.17 kB
Upload folder using huggingface_hub
3 months ago
config_sft_test_env.yaml
Safe
2.02 kB
Upload folder using huggingface_hub
3 months ago
grpo_max_completion.py
9.37 kB
Upload folder using huggingface_hub
3 months ago
grpo_offline_run.py
8.5 kB
Upload folder using huggingface_hub
3 months ago
run_dpo.py
Safe
10.3 kB
Upload folder using huggingface_hub
3 months ago
run_sft_test_env.py
Safe
7.86 kB
Upload folder using huggingface_hub
3 months ago
setup.cfg
Safe
698 Bytes
Upload folder using huggingface_hub
3 months ago
setup.py
Safe
4.9 kB
Upload folder using huggingface_hub
3 months ago
test.json
Safe
269 kB
Upload folder using huggingface_hub
3 months ago