Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
/
recipes
/
gpt2-nl
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
zswzswzsw
Upload folder using huggingface_hub
ae40651
verified
3 months ago
cpt
Upload folder using huggingface_hub
3 months ago
dpo
Upload folder using huggingface_hub
3 months ago
sft
Upload folder using huggingface_hub
3 months ago
README.md
Safe
2.68 kB
Upload folder using huggingface_hub
3 months ago