weege007
/

Qwen2.5-0.5B-Instruct_grpo_Countdown-Tasks-3to4

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

Qwen2.5-0.5B-Instruct_grpo_Countdown-Tasks-3to4 / merges.txt

weege007's picture

Training in progress, step 25

e939625 verified 28 days ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.