Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

moonshotai
/
Moonlight-16B-A3B-Instruct

Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
Model card Files Files and versions Community
14
Moonlight-16B-A3B-Instruct / figures
Ctrl+K
Ctrl+K
  • 6 contributors
History: 1 commit
liushaowei
first commit
391e7a8 3 months ago
  • banner.png
    48.8 kB
    first commit 3 months ago
  • banner_short.png
    26.9 kB
    first commit 3 months ago
  • chinlaw_8k_flops_ratio.png
    145 kB
    first commit 3 months ago
  • fig_MMLU_performance.png
    225 kB
    first commit 3 months ago
  • fig_weight_decay.png
    416 kB
    first commit 3 months ago
  • logo.png
    13.1 kB
    first commit 3 months ago
  • megatron.png
    1.99 kB
    first commit 3 months ago
  • scaling.png
    224 kB
    first commit 3 months ago