Ctrl+K

2 contributors

History: 5 commits

This model has 1 file scanned as unsafe.

Ibisbill

nielsr HF Staff

Add library name and GitHub link to model card (#1)

e193a62 verified 15 days ago

.gitattributes

1.57 kB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
README.md

5.36 kB

Add library name and GitHub link to model card (#1) 15 days ago
added_tokens.json

707 Bytes

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
config.json

730 Bytes

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
generation_config.json

117 Bytes

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
merges.txt

1.67 MB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00001-of-00006.safetensors

4.98 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00002-of-00006.safetensors

4.98 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00003-of-00006.safetensors

4.93 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00004-of-00006.safetensors

4.98 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00005-of-00006.safetensors

4.93 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model-00006-of-00006.safetensors

4.73 GB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
model.safetensors.index.json

36.5 kB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
paper_metadata.json

1.06 kB

Upload paper_metadata.json with huggingface_hub 17 days ago
scheduler.pt
Pickle imports
- No problematic imports detected
What is a pickle import?
1.06 kB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
special_tokens_map.json

613 Bytes

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
tokenizer.json

11.4 MB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
tokenizer_config.json

9.73 kB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
trainer_state.json

3.38 kB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
training_args.bin
Detected Pickle imports (15)
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.training_args.OptimizerNames",
- "torch.bfloat16",
- "torch.device",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy",
- "__builtin__.getattr",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.IntervalStrategy",
- "llamafactory.hparams.training_args.TrainingArguments",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig"
How to fix it?
7.74 kB
xet

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
vocab.json

2.78 MB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago
zero_to_fp32.py

29.2 kB

Upload model from paper: Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning 17 days ago

Pickle imports

Detected Pickle imports (15)