Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
davanstrien
/
Qwen2.5-VL-3B-Instruct-grpo2
like
0
Image-to-Text
Transformers
TensorBoard
Safetensors
qwen2_5_vl
Generated from Trainer
grpo
trl
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2.5-VL-3B-Instruct-grpo2
/
runs
56.3 kB
1 contributor
History:
16 commits
davanstrien
HF Staff
Training in progress, step 60
a6a6b8d
verified
about 1 month ago
Sep05_18-17-45_aa68299fe13e
Training in progress, step 10
about 1 month ago
Sep05_18-20-11_aa68299fe13e
Training in progress, step 130
about 1 month ago
Sep05_19-05-51_aa68299fe13e
Training in progress, step 60
about 1 month ago