Muennighoff
/

Qwen2.5-1.5B-hl-baseline-v8

Model card Files Files and versions Community

Qwen2.5-1.5B-hl-baseline-v8

Ctrl+K

Ctrl+K

1 contributor

History: 7 commits

Muennighoff's picture

Training in progress, step 960

c151dde verified about 3 hours ago

.gitattributes

1.57 kB

Training in progress, step 160 about 16 hours ago
added_tokens.json

605 Bytes

Training in progress, step 160 about 16 hours ago
config.json

685 Bytes

Training in progress, step 160 about 16 hours ago
merges.txt

1.67 MB

Training in progress, step 160 about 16 hours ago
model.safetensors

3.55 GB
LFS

Training in progress, step 960 about 3 hours ago
special_tokens_map.json

613 Bytes

Training in progress, step 160 about 16 hours ago
tokenizer.json

11.4 MB
LFS

Training in progress, step 160 about 16 hours ago
tokenizer_config.json

7.34 kB

Training in progress, step 160 about 16 hours ago
training_args.bin
Detected Pickle imports (14)
- "torch.device",
- "accelerate.state.PartialState",
- "torch.bfloat16",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "open_r1.configs.GRPOConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig"
How to fix it?
8.18 kB
LFS

Training in progress, step 160 about 16 hours ago
vocab.json

2.78 MB

Training in progress, step 160 about 16 hours ago