DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Xinyu Ma, Ziyang Ding, Zhicong Luo, Chi Chen, Zonghao Guo, Derek F. Wong, Xiaoyi Feng, Maosong Sun


This is the official repository of DeepPerception, an MLLM enhanced with cognitive visual perception capabilities.

Downloads last month
0
Safetensors
Model size
8.29B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for MaxyLee/DeepPerception

Base model

Qwen/Qwen2-VL-7B
Finetuned
(227)
this model
Quantizations
1 model

Collection including MaxyLee/DeepPerception