view article Article Agentic RAG Stack (3/5) - Generate responses using a SmolLM By davidberenstein1957 • about 10 hours ago • 4
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • 10 days ago • 15
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • 1 day ago • 6
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 14 days ago • 59
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 23 days ago • 136
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 136
GLiREL -- Generalist Model for Zero-Shot Relation Extraction Paper • 2501.03172 • Published Jan 6 • 1
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 17
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated Dec 18, 2024 • 52
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • Jul 9, 2024 • 42
Positions Datasets Collection Datasets where each row is a chess position • 4 items • Updated 28 days ago • 6
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 29
Tucano Collection Tucano is a series of decoder-transformers based on the Llama 2 architecture, natively pre-trained in Portuguese. • 17 items • Updated Nov 13, 2024 • 2
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 145
LLM2Encoder Collection Collection of initial models and models that use converted decoders to encoders as backbones • 11 items • Updated Sep 10, 2024 • 6
GLiNER bi-encoders Collection Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10, 2024 • 13