Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QiWang98
/
VideoRFT
like
3
Video-Text-to-Text
Transformers
Safetensors
QiWang98/VideoRFT-Data
English
qwen2_5_vl
image-to-text
text-generation-inference
arxiv:
2505.12434
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
cc67150
VideoRFT
/
README.md
QiWang98
initial commit
481d7bd
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
Safe
31 Bytes
metadata
license:
apache-2.0