Qwen-2.5-7B-GRPO-NoBaseline_934 / trainer_state.json
luckeciano's picture
Model save
69fe790 verified
File too large to display, you can check the raw version instead.