Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. ā¢ 46 items ā¢ Updated Feb 26 ā¢ 587
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ā¢ 2502.02737 ā¢ Published Feb 4 ā¢ 223
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper ā¢ 2504.01990 ā¢ Published 15 days ago ā¢ 238
ā UI is a good thing š ā Collection cool spaces with a cool UI, what could be better? ā¢ 5 items ā¢ Updated Jun 18, 2024 ā¢ 17
On Relation-Specific Neurons in Large Language Models Paper ā¢ 2502.17355 ā¢ Published Feb 24 ā¢ 7
MMTEB Collection Our contribution to the Massive Multilingual Text Embedding Benchmark (MMTEB). Retrieval and reranking benchmarks in 16 languages. ā¢ 4 items ā¢ Updated Jun 6, 2024 ā¢ 3
MMTEB: Massive Multilingual Text Embedding Benchmark Paper ā¢ 2502.13595 ā¢ Published Feb 19 ā¢ 34
CommonCrawl Collection Large web-mined general corpus based on CommonCrawl. ā¢ 8 items ā¢ Updated 1 day ago ā¢ 2
NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper ā¢ 2502.05167 ā¢ Published Feb 7 ā¢ 15
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. ā¢ 8 items ā¢ Updated Nov 23, 2024 ā¢ 81
How Transliterations Improve Crosslingual Alignment Paper ā¢ 2409.17326 ā¢ Published Sep 25, 2024 ā¢ 1