Apriel Collection ServiceNow Language Modeling Lab's first model family series • 2 items • Updated 5 days ago • 7
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 4 days ago • 68
ColPali Models Collection Pre-trained checkpoints for the ColPali model. • 8 items • Updated Jan 23 • 5
ColQwen2 Models Collection Pre-trained checkpoints for the ColQwen2 model. • 4 items • Updated Jan 23 • 4
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10 • 150
Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated Nov 28, 2024 • 56
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Mar 13 • 302
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 124
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 286 items • Updated Mar 18 • 26
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated 1 day ago • 228
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 210