Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
SmolLM2_135M_Grpo_Gsm8k
like
8
Text Generation
Transformers
Safetensors
openai/gsm8k
English
llama
text-generation-inference
GRPO
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
SmolLM2_135M_Grpo_Gsm8k
Commit History
Upload SmolLM x Grpo M1.ipynb
faed2a1
verified
prithivMLmods
commited on
Feb 17
Update README.md
b1ba512
verified
prithivMLmods
commited on
Feb 17
Update README.md
aec8fcb
verified
prithivMLmods
commited on
Feb 17
Update README.md
83f789d
verified
prithivMLmods
commited on
Feb 17
Update README.md
703c0da
verified
prithivMLmods
commited on
Feb 17
Update README.md
55c08df
verified
prithivMLmods
commited on
Feb 17
Upload SmolLM_x_Grpo.ipynb
5e7edcf
verified
prithivMLmods
commited on
Feb 17
Create README.md
0925e22
verified
prithivMLmods
commited on
Feb 17
Upload folder using huggingface_hub
d06a6c3
verified
prithivMLmods
commited on
Feb 17
initial commit
9a54d3e
verified
prithivMLmods
commited on
Feb 17