Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hazentr
/
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quick_timid_frog
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
gensyn
trl
rl-swarm
I am quick timid frog
grpo
genrl-swarm
I am quick_timid_frog
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quick_timid_frog
/
generation_config.json
Commit History
rl-swarm: round 8298, agent quick_timid_frog
c9a1805
verified
hazentr
commited on
14 days ago
End of training
fc00c82
verified
hazentr
commited on
Jun 22
End of training
84963a3
verified
hazentr
commited on
Apr 10
End of training
db6be84
verified
hazentr
commited on
Apr 3