byh711
/

Qwen2.5-3B-MATH-GRPO-KOR

Reinforcement Learning

Model card Files Files and versions Community

Qwen2.5-3B-MATH-GRPO-KOR / vocab.json

byh711's picture

Upload tokenizer (Trained with Unsloth)

954338a verified 5 days ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.