Qwen/Qwen2.5-Coder-0.5B-Instruct Text Generation • 0.5B • Updated Nov 18, 2024 • 219k • • 66
papluca/xlm-roberta-base-language-detection Text Classification • 0.3B • Updated Dec 28, 2023 • 557k • • 375
INSAIT-Institute/BgGPT-Gemma-2-2.6B-IT-v1.0 Text Generation • 3B • Updated Dec 4, 2024 • 643 • • 7
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 168