byh711
/

Qwen2.5-3B-MATH-GRPO-KOR

Reinforcement Learning

Model card Files Files and versions Community

Qwen2.5-3B-MATH-GRPO-KOR

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

byh711's picture

Update README.md

a89a1c9 verified about 18 hours ago

.gitattributes

1.57 kB

Upload tokenizer (Trained with Unsloth) about 18 hours ago
README.md

2.27 kB

Update README.md about 18 hours ago
adapter_config.json

817 Bytes

Initial upload of Korean Math GRPO model (Trained with Unsloth) about 18 hours ago
adapter_model.safetensors

479 MB
LFS

Initial upload of Korean Math GRPO model (Trained with Unsloth) about 18 hours ago
added_tokens.json

605 Bytes

Upload tokenizer (Trained with Unsloth) about 18 hours ago
chat_template.jinja

2.51 kB

Upload tokenizer (Trained with Unsloth) about 18 hours ago
merges.txt

1.67 MB

Upload tokenizer (Trained with Unsloth) about 18 hours ago
special_tokens_map.json

614 Bytes

Upload tokenizer (Trained with Unsloth) about 18 hours ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer (Trained with Unsloth) about 18 hours ago
tokenizer_config.json

4.71 kB

Upload tokenizer (Trained with Unsloth) about 18 hours ago
vocab.json

2.78 MB

Upload tokenizer (Trained with Unsloth) about 18 hours ago