MaxyLee commited on
Commit
150ffbf
·
verified ·
1 Parent(s): d8f1a52

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -7,4 +7,17 @@ metrics:
7
  base_model:
8
  - Qwen/Qwen2-VL-7B-Instruct
9
  pipeline_tag: image-text-to-text
10
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  base_model:
8
  - Qwen/Qwen2-VL-7B-Instruct
9
  pipeline_tag: image-text-to-text
10
+ ---
11
+
12
+ # DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
13
+ Xinyu Ma, Ziyang Ding, Zhicong Luo, Chi Chen, Zonghao Guo, Derek F. Wong, Xiaoyi Feng, Maosong Sun
14
+
15
+ -----
16
+
17
+ <a href='https://deepperception-kvg.github.io/'><img src='https://img.shields.io/badge/Project-Page-blue'></a>
18
+ <a href='https://arxiv.org/abs/2503.12797'><img src='https://img.shields.io/badge/Paper-PDF-Green'></a>
19
+ <a href='https://github.com/MaxyLee/DeepPerception'><img src='https://img.shields.io/badge/Github-Page-green'></a>
20
+ <a href='https://huggingface.co/datasets/MaxyLee/KVG-Bench'><img src='https://img.shields.io/badge/Benchmark-Huggingface-orange'></a>
21
+ <a href='https://huggingface.co/datasets/MaxyLee/KVG'><img src='https://img.shields.io/badge/Dataset-Huggingface-purple'></a>
22
+
23
+ This is the official repository of **DeepPerception**, an MLLM enhanced with cognitive visual perception capabilities.