Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
burtenshaw
/
grpo-Qwen2.5-VL-3B-Instruct-LoRA
like
0
Transformers
Safetensors
Generated from Trainer
grpo
trl
hf_jobs
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
a7f739d
grpo-Qwen2.5-VL-3B-Instruct-LoRA
1.52 kB
1 contributor
History:
1 commit
burtenshaw
HF Staff
initial commit
a7f739d
verified
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago