Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA3-2B-Image
like
6
Follow
Language Technology Lab at Alibaba DAMO Academy
92
Visual Question Answering
Transformers
Safetensors
5 datasets
English
videollama3_qwen2
text-generation
multi-modal
large-language-model
video-language-model
custom_code
arxiv:
2501.13106
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Use this model
Update README.md
#2
by
mfarre
HF staff
- opened
2 days ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+1
-0
mfarre
2 days ago
No description provided.
Update README.md
cf917b8b
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Ready to merge
This branch is ready to get merged automatically.
Comment
·
Sign up
or
log in
to comment