Qwen-2.5-7B-GRPO-Base-4Action_221 / trainer_state.json
luckeciano's picture
Model save
2b8d530 verified
raw
history contribute delete
763 kB
File too large to display, you can check the raw version instead.