sergiopaniego
/

Qwen2-0.5B-GRPO-vllm-trl

Generated from Trainer

Model card Files Files and versions

Qwen2-0.5B-GRPO-vllm-trl / merges.txt

sergiopaniego's picture

sergiopaniego HF Staff

Training in progress, step 10

b11814d verified 15 days ago

1.67 MB

File too large to display, you can check the raw version instead.