davidoj01
/

unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

davidoj01's picture

initial commit

d33ee17 verified 6 months ago

.gitattributes

1.52 kB

initial commit 6 months ago