Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lordChipotle
/
Llama3GRPOReasoning
like
1
Reinforcement Learning
Safetensors
openai/gsm8k
llama
Model card
Files
Files and versions
Community
main
Llama3GRPOReasoning
Commit History
Update README.md
b6e7b7a
verified
lordChipotle
commited on
Jul 13
Update README.md
df619ee
verified
lordChipotle
commited on
Jul 13
Update README.md
e2a1961
verified
lordChipotle
commited on
Jul 13
Update README.md
7f5ec28
verified
lordChipotle
commited on
Jul 13
Update README.md
6985228
verified
lordChipotle
commited on
Jun 4
Update README.md
0f7d24d
verified
lordChipotle
commited on
Jun 4
Update README.md
30e56b4
verified
lordChipotle
commited on
Jun 4
Update README.md
c040fc4
verified
lordChipotle
commited on
Jun 4
Update README.md
a36e863
verified
lordChipotle
commited on
Jun 4
Upload tokenizer
95f662e
verified
lordChipotle
commited on
Jun 4
Upload LlamaForCausalLM
b3ebf5c
verified
lordChipotle
commited on
Jun 4
initial commit
86a7951
verified
lordChipotle
commited on
Jun 4