Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Falconss1
/
TW-GRPO
like
0
Video-Text-to-Text
Transformers
Safetensors
7 datasets
English
qwen2_5_vl
image-to-text
video-understanding
reasoning
multimodal
reinforcement-learning
question-answering
text-generation-inference
arxiv:
2505.24718
License:
mit
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
8d76b75
TW-GRPO
/
README.md
Falconss1
initial commit
ffab208
verified
4 months ago
preview
code
|
raw
Copy download link
history
blame
Safe
24 Bytes
metadata
license:
mit