Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 70 items β’ Updated Dec 10, 2025 β’ 169
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 1.52M β’ β’ 1.64k
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2
MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model Paper β’ 2602.06393 β’ Published Feb 6 β’ 3
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 29 days ago β’ 2