Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DavidAU
/
DeepThought-MOE-8X3B-R1-Llama-3.2-Reasoning-18B
like
0
Text Generation
Transformers
Safetensors
mixtral
Llama 3.2
8 X 3B
128k context
Mixture of Experts
8 experts
reasoning
thinking
r1
cot
deepseek
mixture of experts
mergekit
Merge
llama-3
llama-3.2
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepThought-MOE-8X3B-R1-Llama-3.2-Reasoning-18B
Commit History
Update README.md
0465f5b
verified
DavidAU
commited on
May 28
Upload folder using huggingface_hub
39b96b8
verified
DavidAU
commited on
Feb 21
Update README.md
73101d2
verified
DavidAU
commited on
Feb 21
Create README.md
e89ac7f
verified
DavidAU
commited on
Feb 21
Delete mergekit_moe_config.yml
c7dda34
verified
DavidAU
commited on
Feb 21
Upload folder using huggingface_hub
e845bd3
verified
DavidAU
commited on
Feb 21
initial commit
c592e90
verified
DavidAU
commited on
Feb 21