Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Falconss1
/
TW-GRPO
like
0
Video-Text-to-Text
Transformers
Safetensors
7 datasets
English
qwen2_5_vl
image-to-text
video-understanding
reasoning
multimodal
reinforcement-learning
question-answering
text-generation-inference
arxiv:
2505.24718
License:
mit
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
ffab208
TW-GRPO
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
Falconss1
initial commit
ffab208
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
Safe
24 Bytes
initial commit
4 months ago