L-Hongbin
's Collections
MutiModal_Dataset
updated
Updated
•
13.1k
•
90
Updated
•
7.37k
•
123
WildVision/wildvision-chat
Viewer
•
Updated
•
45.2k
•
171
•
20
Viewer
•
Updated
•
12.4M
•
2.28k
•
151
lmms-lab/LLaVA-Video-178K
Viewer
•
Updated
•
1.63M
•
18.9k
•
113
Viewer
•
Updated
•
7.29M
•
2.47k
•
41
Viewer
•
Updated
•
1.66M
•
48
VILA-U: a Unified Foundation Model Integrating Visual Understanding and
Generation
Paper
•
2409.04429
•
Published
Viewer
•
Updated
•
235M
•
8.24k
•
35
Viewer
•
Updated
•
9.81M
•
1.74k
•
48
JefferyZhan/Language-prompted-Localization-Dataset
Preview
•
Updated
•
130
•
3
Viewer
•
Updated
•
392
•
79
•
10
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
•
623M
•
281k
•
82
DINO-X: A Unified Vision Model for Open-World Object Detection and
Understanding
Paper
•
2411.14347
•
Published
•
13
Preview
•
Updated
•
84
•
49
Viewer
•
Updated
•
72.5k
•
31
•
7
Viewer
•
Updated
•
10.9M
•
63
•
8
Viewer
•
Updated
•
1.09M
•
44
•
2
Viewer
•
Updated
•
110k
•
274
•
2
Salesforce/blip3-grounding-50m
Viewer
•
Updated
•
52.4M
•
758
•
20