Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jinliuxi
/
deepmini-it
like
0
Safetensors
Chinese
English
deepseek_v3
deepseek
Mixture of Experts
instruction-tuning
bilingual
reasoning
code
math
arxiv:
2405.04434
License:
apache-2.0
Model card
Files
Files and versions
Community
main
deepmini-it
Ctrl+K
Ctrl+K
1 contributor
History:
16 commits
jinliuxi
Update README.md
46e608f
verified
16 days ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
Safe
13.6 kB
Update README.md
16 days ago
config.json
Safe
1.42 kB
Upload DeepseekV3ForCausalLM
17 days ago
configuration_deepseek.py
Safe
10.6 kB
Upload 2 files
about 1 month ago
generation_config.json
Safe
132 Bytes
Upload DeepseekV3ForCausalLM
about 1 month ago
model.safetensors
Safe
1.34 GB
LFS
Upload DeepseekV3ForCausalLM
17 days ago
modeling_deepseek.py
Safe
75.7 kB
Update modeling_deepseek.py
about 1 month ago
special_tokens_map.json
Safe
485 Bytes
Upload tokenizer
about 1 month ago
tokenizer.json
Safe
9.98 MB
Upload tokenizer
about 1 month ago
tokenizer_config.json
Safe
166 kB
Upload tokenizer
about 1 month ago