arxiv:2412.05271
Zhe Chen
czczup
AI & ML interests
multimodal large language model, vision foundation model
Recent Activity
new activity
10 days ago
OpenGVLab/InternViT-300M-448px:Add library name and github repo
new activity
11 days ago
OpenGVLab/InternVL-14B-224px:InternViT-6B + QLLaMA, can be used for image-text retrieval like CLIP
new activity
12 days ago
OpenGVLab/InternVL2-8B-AWQ:Adding `safetensors` variant of this model
Organizations
Papers
24
spaces
1
models
5
datasets
None public yet