akbarsigit
/

llama3.1-grpo-r128-a256-merged-16bit

Text Generation

text-generation-inference

Model card Files Files and versions Community

llama3.1-grpo-r128-a256-merged-16bit / model.safetensors.index.json

Commit History

Trained with Unsloth

6f1f037
verified

akbarsigit commited on May 25