Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhibin Lan's picture
8 2 3

Zhibin Lan

zhibinlan
JohnRoger's profile picture gentlebowl's profile picture dark-pen's profile picture
·

AI & ML interests

None yet

Organizations

None yet

Collections 1

LLaVE
LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets
  • zhibinlan/LLaVE-0.5B

    Image-Text-to-Text • Updated Mar 14 • 33.8k • 7
  • zhibinlan/LLaVE-2B

    Image-Text-to-Text • Updated Mar 14 • 22.4k • 45
  • zhibinlan/LLaVE-7B

    Image-Text-to-Text • Updated Mar 14 • 709 • 5
  • LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

    Paper • 2503.04812 • Published Mar 4 • 15

Papers 3

arxiv:2503.04812
arxiv:2410.04439
arxiv:2410.02745

models 5

zhibinlan/LLaVE-7B

Image-Text-to-Text • Updated Mar 14 • 709 • 5

zhibinlan/LLaVE-0.5B

Image-Text-to-Text • Updated Mar 14 • 33.8k • 7

zhibinlan/LLaVE-2B

Image-Text-to-Text • Updated Mar 14 • 22.4k • 45

zhibinlan/AVG-LLaVA

Image-Text-to-Text • Updated Oct 12, 2024 • 8 • 2

zhibinlan/AVG-LLaVA-Stage3

Image-Text-to-Text • Updated Oct 12, 2024 • 10

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs