p1atdev
/

qwen2.5-0.5b-grpo-math-01

Text Generation

text-generation-inference

Model card Files Files and versions

qwen2.5-0.5b-grpo-math-01

1 GB

1 contributor

History: 20 commits

lbourdois's picture

Improve language tag

45977c6 verified 8 months ago

.gitattributes

1.57 kB

Training in progress, step 10 10 months ago
README.md

7.19 kB

Improve language tag 8 months ago
added_tokens.json

605 Bytes

Training in progress, step 10 10 months ago
config.json

744 Bytes

Training in progress, step 10 10 months ago
merges.txt

1.67 MB

Training in progress, step 10 10 months ago
model.safetensors

988 MB
xet

Training in progress, step 140 10 months ago
special_tokens_map.json

502 Bytes

Training in progress, step 10 10 months ago
tokenizer.json

11.4 MB
xet

Training in progress, step 10 10 months ago
tokenizer_config.json

7.26 kB

Training in progress, step 10 10 months ago
training_args.bin
Detected Pickle imports (10)
- "trl.trainer.grpo_config.GRPOConfig",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SaveStrategy",
- "torch.device"
How to fix it?
5.62 kB
xet

Training in progress, step 10 10 months ago
vocab.json

2.78 MB

Training in progress, step 10 10 months ago