Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rohitg 's Collections
multimodal diffusion llm
Reasoning VideoLLMs
Multi-Modal Reasoning Models
Multi-modal Embedding Models

Multi-modal Embedding Models

updated Aug 9
Upvote
1

  • Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

    Sentence Similarity • 2B • Updated Jun 9 • 23.7k • 106

  • royokong/e5-v

    Image-to-Text • 8B • Updated Oct 31, 2024 • 25.3k • 28

  • TIGER-Lab/VLM2Vec-LoRA

    Text Generation • Updated Jul 13 • 22 • 11

  • nvidia/MM-Embed

    8B • Updated Nov 6, 2024 • 988 • 61

  • MCG-NJU/CaRe-7B

    8B • Updated Mar 16 • 10 • 1

  • DeepGlint-AI/UniME-Phi3.5-V-4.2B

    Image-Text-to-Text • Updated May 6 • 548 • 7

  • DeepGlint-AI/UniME-LLaVA-1.6-7B

    Image-Text-to-Text • 8B • Updated May 6 • 481 • 5

  • BAAI/BGE-VL-base

    0.1B • Updated Mar 5 • 3.14k • 23
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs