Open to Collab

40 80 42

Qinghong (Kevin) Lin

KevinQHLin

http://qhlin.me/

KevinQHLin
QinghongLin
kevinqhlin

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper about 20 hours ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

authored a paper 2 days ago

Egocentric Video-Language Pretraining

liked a model 4 days ago

GD-ML/Code2World

View all activity

Organizations

Articles 1

Article

When Vision Meets Code

Collections 7

View 7 collections

Papers 31

spaces 2

Paper2Poster

🚀

UniVTG

👁

models 1

KevinQHLin/VLog

Updated Mar 12, 2025

datasets 2

KevinQHLin/RICO

Preview • Updated Feb 11, 2025 • 38

KevinQHLin/ScreenSpot

Viewer • Updated Jan 1, 2025 • 1.27k • 596 • 1

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

Papers 31

spaces 2

Paper2Poster

UniVTG

models 1

KevinQHLin/VLog

datasets 2

KevinQHLin/RICO

KevinQHLin/ScreenSpot

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

ShowUI

ShowUI

Papers 31

spaces 2 Sort: Recently updated

Paper2Poster

UniVTG

models 1

datasets 2 Sort: Recently updated

spaces 2

datasets 2