Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MaxyLee
/
DeepPerception
like
1
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_vl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2503.12797
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
2928679
DeepPerception
/
README.md
MaxyLee
Create README.md
5a29c55
verified
11 days ago
preview
code
|
raw
Copy download link
history
blame
135 Bytes
---
license:
apache-2.0
language:
-
en
metrics:
-
accuracy
base_model:
-
Qwen/Qwen2-VL-7B-Instruct
pipeline_tag:
image-text-to-text
---