Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Falconss1
/
TW-GRPO
like
0
Video-Text-to-Text
Transformers
Safetensors
7 datasets
English
qwen2_5_vl
image-to-text
video-understanding
reasoning
multimodal
reinforcement-learning
question-answering
text-generation-inference
arxiv:
2505.24718
License:
mit
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
TW-GRPO
/
chat_template.json
Commit History
Upload 5 files
e5486d2
verified
Falconss1
commited on
May 30