Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jinliuxi
/
deepmini-it
like
0
Safetensors
Chinese
English
deepseek_v3
deepseek
Mixture of Experts
instruction-tuning
bilingual
reasoning
code
math
arxiv:
2405.04434
License:
apache-2.0
Model card
Files
Files and versions
Community
a022d31
deepmini-it
Commit History
Update config.json
a022d31
verified
jinliuxi
commited on
29 days ago
Upload DeepseekV3ForCausalLM
4762455
verified
jinliuxi
commited on
29 days ago
Update README.md
22aa719
verified
jinliuxi
commited on
Apr 16
Update README.md
4897272
verified
jinliuxi
commited on
Apr 16
Update README.md
6711b8d
verified
jinliuxi
commited on
Apr 16
Update README.md
27e8d0a
verified
jinliuxi
commited on
Apr 16
Update README.md
c027074
verified
jinliuxi
commited on
Apr 16
Update README.md
57d9a8b
verified
jinliuxi
commited on
Apr 16
Update modeling_deepseek.py
7eb823b
verified
jinliuxi
commited on
Apr 13
Upload 2 files
f11b29a
verified
jinliuxi
commited on
Apr 13
Update config.json
cf861c8
verified
jinliuxi
commited on
Apr 13
Upload tokenizer
32aaff9
verified
jinliuxi
commited on
Apr 13
Upload DeepseekV3ForCausalLM
749267d
verified
jinliuxi
commited on
Apr 13
initial commit
2fd352d
verified
jinliuxi
commited on
Apr 13