hibikigf88
/

SmolLM-135M-Instruct-smoltldr-GRPO

Text Generation

text-generation-inference

Model card Files Files and versions Community

SmolLM-135M-Instruct-smoltldr-GRPO

Commit History

Update README.md

bfea87e
verified

hibikigf88 commited on Jun 7

Upload LlamaForCausalLM

bf98aec
verified

hibikigf88 commited on Jun 7

initial commit

c51e8ad
verified

hibikigf88 commited on Jun 7