Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Bleking
/
Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
like
0
Safetensors
Model card
Files
Files and versions
Community
main
Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
/
checkpoint-56
/
global_step56
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Bleking
Push Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
932ae9b
9 months ago
zero_pp_rank_0_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"__builtin__.set"
,
"torch.HalfStorage"
,
"torch.Size"
How to fix it?
402 kB
LFS
Push Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
9 months ago
zero_pp_rank_0_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (8)
"torch.float16"
,
"torch.FloatStorage"
,
"torch.Tensor"
,
"torch._tensor._rebuild_from_type_v2"
,
"collections.OrderedDict"
,
"deepspeed.runtime.fp16.loss_scaler.DynamicLossScaler"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
29.5 GB
LFS
Push Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
9 months ago
zero_pp_rank_1_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"__builtin__.set"
,
"torch.HalfStorage"
,
"torch.Size"
How to fix it?
402 kB
LFS
Push Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
9 months ago
zero_pp_rank_1_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (8)
"deepspeed.runtime.fp16.loss_scaler.DynamicLossScaler"
,
"torch._utils._rebuild_tensor_v2"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch.FloatStorage"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"collections.OrderedDict"
,
"torch.float16"
,
"torch.Tensor"
How to fix it?
29.5 GB
LFS
Push Llama-3.1-Minitron-4B-Width-Base-ep8-lr3e5-wc001
9 months ago