Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Model Tree
Reset
Qwen/Qwen2-0.5B-Instruct
Adapters
Finetunes
Quantizations
Merges
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Fireworks
Cerebras
Nebius AI
Novita
Together AI
fal
Nscale
Groq
Hyperbolic
Featherless AI
Zai
SambaNova
Cohere
Replicate
Public AI
Scaleway
HF Inference API
Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
444
Full-text search
Edit filters
Sort: Trending
Active filters:
Qwen/Qwen2-0.5B-Instruct
Clear all
lululele/Qwen2-0.5B-GRPO-test
Updated
Mar 12
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
Mar 13
MarcCarauleanu/Qwen2-0.5B-GRPO-test
Text Generation
•
0.5B
•
Updated
Mar 14
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
Updated
Mar 12
sravanthib/multinode-try
Updated
Mar 13
sravanthib/qwen-32b-multinode-try
Updated
Mar 13
dulguun222/Qwen2-0.5B-GRPO-test
Updated
May 16
ZECTBynmo/Qwen2-0.5B-GRPO-test
Updated
Mar 14
GSukesh/Qwen2-0.5B-GRPO-test
Updated
Mar 14
Cijov/Qwen2-0.5B-GRPO-test
Updated
Mar 16
mamba413/Qwen2-0.5B-Reward-DR-SIMU
Text Classification
•
0.5B
•
Updated
Mar 15
zjc664656505/Qwen2-0.5B-GRPO-test
Updated
Mar 16
mamba413/Qwen2-0.5B-Reward-DR-SIMU-Seed0
Text Classification
•
0.5B
•
Updated
Mar 16
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_1
Updated
Apr 28
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_0
Updated
Mar 19
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_1
Updated
Mar 19
bwshook/Qwen2-0.5B-GRPO-test
Updated
Mar 18
dulguun222/Dulguun-1B-GRPO-test
Updated
Mar 19
mamba413/Qwen2-0.5B-Reward-DR-HH-Seed0
Text Classification
•
0.5B
•
Updated
Mar 19
lemanh151148/Qwen2-0.5B-GRPO-test
Updated
Mar 19
Rick9chen/Qwen2-0.5B-GRPO-test
Updated
Mar 23
blackjack007/Qwen2-0.5B-GRPO-test
Updated
Apr 2
Shahradmz/Qwen2-0.5B-Reward_debug_mas
Text Classification
•
0.5B
•
Updated
Mar 19
•
1
tsamtsam/Qwen2-0.5B-GRPO-test
Updated
Mar 19
marcano/Qwen2-0.5B-GRPO-test
Updated
Mar 20
yhuanghamu/Qwen2-0.5B-GRPO-test
Updated
Mar 21
sudocoder/Qwen2-0.5B-GRPO-test
Updated
Mar 21
mitultiwari/Qwen2-0.5B-GRPO-test
Updated
Mar 23
MohamedZayton/Qwen2-0.5B-GRPO-test
Text Generation
•
0.5B
•
Updated
Mar 24
•
1
•
1
chaotic-world12/Qwen2-0.5B-GRPO-test
Updated
Mar 24
Previous
1
...
3
4
5
6
7
...
15
Next