arxiv:2501.05122
Flo Schneider
floschne
AI & ML interests
Large Vision-Language Models, Cross-modal Retrieval
Recent Activity
liked
a model
1 day ago
Gregor/mblip-mt0-xl
liked
a model
2 days ago
WueNLP/centurio_aya
authored
a paper
6 days ago
Why do LLaVA Vision-Language Models Reply to Images in English?
Organizations
models
None public yet
datasets
14
floschne/wismir3
Viewer
•
Updated
•
301k
•
102
floschne/xflickrco_1k
Viewer
•
Updated
•
8k
•
42
•
1
floschne/xflickrco
Viewer
•
Updated
•
16k
•
63
•
1
floschne/xgqa_1k
Viewer
•
Updated
•
8k
•
34
floschne/xvnli
Viewer
•
Updated
•
5.82k
•
30
floschne/xgqa
Viewer
•
Updated
•
77.3k
•
93
floschne/xm3600_1k
Updated
•
88
floschne/xm3600
Updated
•
50
•
5
floschne/m5b_vlod
Viewer
•
Updated
•
1.42k
•
29
floschne/m5b_vgr
Viewer
•
Updated
•
1.43k
•
29