riyazahuja
/

DeepSeek-R1-Distill-Qwen-1.5B_demo

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-1.5B_demo

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

riyazahuja's picture

Upload folder using huggingface_hub

63d9d44 verified 6 months ago

checkpoint-172
Upload folder using huggingface_hub 6 months ago
checkpoint-344
Upload folder using huggingface_hub 6 months ago
checkpoint-516
Upload folder using huggingface_hub 6 months ago
.gitattributes

1.77 kB

Upload folder using huggingface_hub 6 months ago
README.md

3.58 kB

Upload folder using huggingface_hub 6 months ago
adapter_config.json

814 Bytes

Upload folder using huggingface_hub 6 months ago
adapter_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
74 MB
LFS

Upload folder using huggingface_hub 6 months ago
config.json

1.29 kB

Upload folder using huggingface_hub 6 months ago
special_tokens_map.json

485 Bytes

Upload folder using huggingface_hub 6 months ago
tokenizer.json

11.4 MB
LFS

Upload folder using huggingface_hub 6 months ago
tokenizer_config.json

6.76 kB

Upload folder using huggingface_hub 6 months ago