MoeReward
/

rl_checkpoints

Model card Files Files and versions Community

rl_checkpoints / qwen1.5_base_rule_base_grpo_naive /merges.txt

shengyi-qian's picture

qwen1.5 rule based

1a74a1a 2 months ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.