mssfj
/

Llama3_2_3B_GRPO_LoRA-GSM8K-1epoc

Text Generation

text-generation-inference

Model card Files Files and versions Community

Llama3_2_3B_GRPO_LoRA-GSM8K-1epoc

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

mssfj's picture

(Trained with Unsloth)

14de0fc verified 5 days ago