Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
grpo_run_code
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
Community
main
grpo_run_code
/
chapters
/
en
/
_toctree.yml
zswzswzsw
Upload folder using huggingface_hub
ae40651
verified
3 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
122 Bytes
-
title:
Unit
0
.
Welcome
to
the
RLHF
Handbook!
sections:
-
local:
chapter0/introduction
title:
What
is
this
about?