LaonA2_VL_3B / README.md
gykim
[Init] Models
0f5f063
metadata
datasets:
  - lmms-lab/RefCOCOg
language:
  - en
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct
pipeline_tag: zero-shot-object-detection

LaonA2 VL 3B

LaonA2 VL 3B๋Š” Qwen 2.5 VL 3B ๊ธฐ๋ฐ˜์˜ ํ–ฅ์ƒ๋œ ๋น„์ „-์–ธ์–ด ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. VLM-R1 ๊ฐ•ํ™”ํ•™์Šต์„ ํ†ตํ•ด REC(Referring Expression Comprehension) ์„ฑ๋Šฅ์ด ๊ฐœ์„ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

cite: arxiv.org/abs/2504.07615