Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
daniel3303
/
QwenStoryteller
like
8
Image-to-Text
Transformers
Safetensors
daniel3303/StoryReasoning
English
qwen2_5_vl
image-text-to-text
vision-language-model
visual-storytelling
chain-of-thought
grounded-text-generation
cross-frame-consistency
storytelling
Eval Results
text-generation-inference
arxiv:
2505.10292
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
QwenStoryteller
Commit History
Added citation
393aff8
verified
daniel3303
commited on
22 days ago
Update README.md
a1ee200
verified
daniel3303
commited on
25 days ago
Update README.md
b2461a2
verified
daniel3303
commited on
25 days ago
Update README.md
42c2f4b
verified
daniel3303
commited on
25 days ago
Upload README.md with huggingface_hub
69107a1
verified
daniel3303
commited on
25 days ago
Upload processor
60af109
verified
daniel3303
commited on
25 days ago
Upload Qwen2_5_VLForConditionalGeneration
5554c34
verified
daniel3303
commited on
25 days ago
initial commit
165aca3
verified
daniel3303
commited on
25 days ago