distily
/

distily_norm_distilgpt2_sweep_extended

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

distily_norm_distilgpt2_sweep_extended

Ctrl+K

Ctrl+K

1 contributor

History: 58 commits

lapp0's picture

Training in progress, step 123750

f3551b8 verified 10 months ago

logs
Training in progress, step 123750 10 months ago
.gitattributes

1.52 kB

initial commit 10 months ago
README.md

3.67 kB

Training in progress, step 123750 10 months ago
benchmarks.shelve.bak

0 Bytes

End of training 10 months ago
benchmarks.shelve.dat

0 Bytes

End of training 10 months ago
benchmarks.shelve.dir

0 Bytes

End of training 10 months ago
config.json

1.02 kB

Training in progress, step 123750 10 months ago
generation_config.json

119 Bytes

Training in progress, step 123750 10 months ago
merges.txt

456 kB

End of training 10 months ago
model.safetensors

164 MB
LFS

Training in progress, step 123750 10 months ago
special_tokens_map.json

131 Bytes

End of training 10 months ago
tokenizer.json

2.11 MB

End of training 10 months ago
tokenizer_config.json

476 Bytes

End of training 10 months ago
training_args.bin
Detected Pickle imports (9)
- "torch.device",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.SchedulerType",
- "distily.args.DistillationTrainingArguments",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.training_args.OptimizerNames"
How to fix it?
5.62 kB
LFS

Training in progress, step 123750 10 months ago
vocab.json

798 kB

End of training 10 months ago