llama3.1-grpo-r128-a256-merged-16bit / model.safetensors.index.json

Commit History

Trained with Unsloth
6f1f037
verified

akbarsigit commited on