Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
MLX LM
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
TGI
Inference Providers
Select all
Fireworks
HF Inference API
Nebius AI
Novita
Nscale
Together AI
Inference Providers with no match
Hyperbolic
Featherless AI
Cohere
Groq
Cerebras
fal
SambaNova
Replicate
Misc
Reset Misc
qwen3_moe
Inference Endpoints
4-bit precision
Mixture of Experts
Merge
8-bit precision
text-generation-inference
Misc with no match
Eval Results
custom_code
text-embeddings-inference
Carbon Emissions
Apply filters
Models
639
Full-text search
Edit filters
Sort: Trending
Active filters:
qwen3_moe
Clear all
fengyao1909/full_s1_sl16k_bs16_lr3e-5_ckpt57
Updated
23 days ago
•
7
fengyao1909/full_s1_sl16k_bs16_lr3e-5_ckpt114
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr3e-5_ckpt171
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr3e-5_ckpt228
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr3e-5_ckpt285
Updated
23 days ago
•
1.72k
fengyao1909/full_s1_sl16k_bs16_lr2e-6_ckpt57
Updated
23 days ago
•
7
fengyao1909/full_s1_sl16k_bs16_lr2e-6_ckpt114
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr2e-6_ckpt171
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr2e-6_ckpt228
Updated
23 days ago
•
5
fengyao1909/full_s1_sl16k_bs16_lr2e-6_ckpt285
Updated
23 days ago
•
5
fengyao1909/ste_s1_sl16k_bs16_lr3e-5_ckpt57
Updated
15 days ago
•
12
fengyao1909/ste_s1_sl16k_bs16_lr3e-5_ckpt114
Updated
15 days ago
•
9
fengyao1909/ste_s1_sl16k_bs16_lr3e-5_ckpt171
Updated
15 days ago
•
6
fengyao1909/ste_s1_sl16k_bs16_lr3e-5_ckpt228
Updated
15 days ago
•
1.41k
fengyao1909/ste_s1_sl16k_bs16_lr3e-5_ckpt285
Updated
15 days ago
•
2.35k
fengyao1909/ste_s1_sl16k_bs16_lr2e-6_ckpt57
Updated
23 days ago
•
7
fengyao1909/ste_s1_sl16k_bs16_lr2e-6_ckpt114
Updated
23 days ago
•
5
fengyao1909/ste_s1_sl16k_bs16_lr2e-6_ckpt171
Updated
23 days ago
•
4
fengyao1909/ste_s1_sl16k_bs16_lr2e-6_ckpt228
Updated
23 days ago
•
4
fengyao1909/ste_s1_sl16k_bs16_lr2e-6_ckpt285
Updated
23 days ago
•
5
fengyao1909/full_nemosci_sl16k_bs64_lr5e-5_ckpt157
Updated
23 days ago
•
5
fengyao1909/full_nemosci_sl16k_bs64_lr5e-5_ckpt314
Updated
23 days ago
•
5
fengyao1909/full_nemosci_sl16k_bs64_lr5e-5_ckpt471
Updated
23 days ago
•
1.52k
fengyao1909/ste_nemosci_sl16k_bs64_lr5e-5_ckpt157
Updated
23 days ago
•
6
fengyao1909/ste_nemosci_sl16k_bs64_lr5e-5_ckpt314
Updated
23 days ago
•
5
omeng-nvidia/saved_models_Qwen3-30B-A3B_fp8_hf
Updated
23 days ago
•
15
fengyao1909/ste_nemosci_sl16k_bs64_lr5e-5_ckpt471
Updated
23 days ago
•
1.51k
fengyao1909/full_nemocode_sl16k_bs64_lr5e-5_ckpt100
Updated
23 days ago
•
5
fengyao1909/full_nemocode_sl16k_bs64_lr5e-5_ckpt200
Updated
23 days ago
•
5
fengyao1909/full_nemocode_sl16k_bs64_lr5e-5_ckpt300
Updated
23 days ago
•
5
Previous
1
...
10
11
12
13
14
...
22
Next