Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22, 2024 • 70
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 3 days ago • 98
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo • 7 days ago • 22
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1, 2024 • 57
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 15
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw • 11 days ago • 22
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published 16 days ago • 11
PubMedBERT Embeddings M2V Collection Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants. • 5 items • Updated 10 days ago • 3
ModernGLiNER Collection GLiNER models based on modern encoder architectures • 2 items • Updated 25 days ago • 6
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 19 days ago • 23
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated about 1 month ago • 48
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 132
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published about 1 month ago • 123
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 30 days ago • 123
view article Article Building a Local Vector Database Index with Annoy and Sentence Transformers By theeseus-ai • Dec 5, 2024 • 3