DX Y
HeartofSheep
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
VLMs
updated
a collection
1 day ago
VLMs
liked
a model
8 days ago
THUDM/CogView4-6B
Organizations
None yet
Collections
4
-
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 24 -
CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Paper • 2503.12329 • Published • 20 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 45
models
None public yet
datasets
None public yet