Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
llama-moe
/
LLaMA-MoE-v1-3_5B-4_16
like
15
Follow
LLaMA-MoE
10
Text Generation
Transformers
PyTorch
English
llama_moe
MoE
custom_code
arxiv:
2310.06694
arxiv:
2406.16554
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Use this model
92393a3
LLaMA-MoE-v1-3_5B-4_16
1 contributor
History:
4 commits
Spico
Update README.md
92393a3
11 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
5.51 kB
Update README.md
11 months ago
config.json
7.36 kB
Upload folder using huggingface_hub
11 months ago
configuration_llama_moe.py
4.41 kB
Upload folder using huggingface_hub
11 months ago
generation_config.json
132 Bytes
Upload folder using huggingface_hub
11 months ago
modeling_llama_moe_hf.py
66.6 kB
Upload folder using huggingface_hub
11 months ago
pytorch_model-00001-of-00002.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
9.98 GB
LFS
Upload folder using huggingface_hub
11 months ago
pytorch_model-00002-of-00002.bin
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.5 GB
LFS
Upload folder using huggingface_hub
11 months ago
pytorch_model.bin.index.json
174 kB
Upload folder using huggingface_hub
11 months ago
special_tokens_map.json
414 Bytes
Upload folder using huggingface_hub
11 months ago
tokenizer.model
500 kB
LFS
Upload folder using huggingface_hub
11 months ago
tokenizer_config.json
796 Bytes
Upload folder using huggingface_hub
11 months ago