view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 187
CoLLM: A Large Language Model for Composed Image Retrieval Paper β’ 2503.19910 β’ Published Mar 25 β’ 14
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 By tomaarsen β’ May 28, 2024 β’ 225
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen β’ Mar 26 β’ 135
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 108
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others β’ Feb 19 β’ 70
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 165
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper β’ 2402.03300 β’ Published Feb 5, 2024 β’ 123
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 147
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.26k
llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins β’ 9 items β’ Updated 28 days ago β’ 37
view article Article Finally, a Replacement for BERT: Introducing ModernBERT By bclavie and 14 others β’ Dec 19, 2024 β’ 647
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 483
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths β’ 3 items β’ Updated Apr 28 β’ 119
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 180
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 403
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper β’ 2501.11425 β’ Published Jan 20 β’ 106
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Paper β’ 2501.12273 β’ Published Jan 21 β’ 14