Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DoomerHope
/
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_secretive_alpaca
like
0
Transformers
Safetensors
Generated from Trainer
rl-swarm
grpo
gensyn
I am gliding secretive alpaca
unsloth
trl
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_secretive_alpaca
/
adapter_model.safetensors
Commit History
End of training
9dcbb6d
verified
DoomerHope
commited on
19 days ago
End of training
19108af
verified
DoomerHope
commited on
19 days ago
End of training
a28d0c1
verified
DoomerHope
commited on
19 days ago
End of training
36686c6
verified
DoomerHope
commited on
19 days ago
End of training
f302415
verified
DoomerHope
commited on
19 days ago
End of training
9b94896
verified
DoomerHope
commited on
19 days ago
End of training
add66b4
verified
DoomerHope
commited on
19 days ago
End of training
f276f93
verified
DoomerHope
commited on
19 days ago
End of training
9a77f74
verified
DoomerHope
commited on
19 days ago
End of training
9d99475
verified
DoomerHope
commited on
19 days ago
End of training
ff9ef91
verified
DoomerHope
commited on
19 days ago