Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Diankun
/
Spatial-MLLM-subset-sft
like
1
Video-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
text-generation
text-generation-inference
arxiv:
2505.23747
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
Diankun
commited on
May 29
Commit
aea91ab
·
verified
·
1 Parent(s):
7abc809
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+6
-3
README.md
CHANGED
Viewed
@@ -1,3 +1,6 @@
1
-
---
2
-
license: mit
3
-
---
1
+
---
2
+
license: mit
3
+
base_model:
4
+
- Qwen/Qwen2.5-VL-3B-Instruct
5
+
pipeline_tag: visual-question-answering
6
+
---