AIR-hl
/

Llama-3.2-1B-DPO

Text Generation

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

Llama-3.2-1B-DPO

2.49 GB

1 contributor

History: 4 commits

AIR-hl's picture

Update README.md

acb4410 verified 10 months ago

runs
upload 10 months ago
.gitattributes

1.57 kB

Upload tokenizer.json 10 months ago
README.md

3.73 kB

Update README.md 10 months ago
config.json

939 Bytes

upload 10 months ago
generation_config.json

184 Bytes

upload 10 months ago
model.safetensors

2.47 GB
xet

upload 10 months ago
special_tokens_map.json

444 Bytes

upload 10 months ago
tokenizer.json

17.2 MB
xet

Upload tokenizer.json 10 months ago
tokenizer_config.json

54.6 kB

upload 10 months ago
trainer_state.json

99.8 kB

upload 10 months ago
training_args.bin
Detected Pickle imports (10)
- "torch.device",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.HubStrategy",
- "trl.trainer.dpo_config.FDivergenceType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "trl.trainer.dpo_config.DPOConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SchedulerType"
How to fix it?
6.14 kB
xet

upload 10 months ago