arxiv:2501.07783
dou wenhan
douwh
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Parameter-Inverted Image Pyramid Networks for Visual Perception and
Multimodal Understanding
authored
a paper
about 1 month ago
SynerGen-VL: Towards Synergistic Image Understanding and Generation with
Vision Experts and Token Folding
new activity
3 months ago
OpenGVLab/Mono-InternVL-2B:Update config.json