Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lordChipotle
/
Llama3GRPOReasoning
like
0
Reinforcement Learning
Safetensors
openai/gsm8k
llama
Model card
Files
Files and versions
Community
main
Llama3GRPOReasoning
/
model-00003-of-00004.safetensors
Commit History
Upload LlamaForCausalLM
b3ebf5c
verified
lordChipotle
commited on
9 days ago