pmahdavi
/

Llama-3.1-8B-math-reasoning

Text Generation

text-generation-inference

Model card Files Files and versions

Llama-3.1-8B-math-reasoning / optimizer_states

96.4 GB

1 contributor

History: 1 commit

pmahdavi's picture

Upload model with optimizer states

1d4abd6 verified 6 months ago

bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
Detected Pickle imports (7)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.Tensor",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch._tensor._rebuild_from_type_v2"
How to fix it?
48.2 GB
xet

Upload model with optimizer states 6 months ago
bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
Detected Pickle imports (7)
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.Tensor",
- "torch._tensor._rebuild_from_type_v2",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
48.2 GB
xet

Upload model with optimizer states 6 months ago