llama3.1-grpo-r64-a128-merged-16bit / model.safetensors.index.json

Commit History

Trained with Unsloth
530367d
verified

akbarsigit commited on