2 11 2

Xuehui Wang

huiserwang

https://huiserwang.site

huiserwang

AI & ML interests

Segmentation

Recent Activity

liked a dataset 2 days ago

OpenGVLab/MMBench-GUI

published a dataset 3 days ago

OpenGVLab/MMBench-GUI

updated a dataset 3 days ago

OpenGVLab/MMBench-GUI

View all activity

Organizations

upvoted a paper 10 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 191

upvoted 3 articles 10 days ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

and 1 other •

Feb 3, 2023

• 69

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 393

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 463

upvoted a paper 29 days ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published 29 days ago • 46

upvoted a paper 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 274

upvoted 2 papers 3 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 69

Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Paper • 2503.19757 • Published Mar 25 • 51

upvoted a paper 6 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 39

upvoted a paper 7 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

upvoted a paper about 1 year ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

Xuehui Wang

AI & ML interests

Recent Activity

Organizations

huiserwang's activity

A Dive into Pretraining Strategies for Vision-Language Models

Vision Language Models Explained

Vision Language Models (Better, Faster, Stronger)