view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ 22 days ago β’ 110
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16 β’ 159
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before By isaacchung and 2 others β’ Apr 24 β’ 14
Running on CPU Upgrade 1.34k 1.34k C4AI Command Models π Start a chat to get answers and explanations from a language model
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper β’ 2405.07920 β’ Published May 13, 2024 β’ 2
EuroBERT Collection Scaling Multilingual Encoders for European Languages β’ 4 items β’ Updated Mar 10 β’ 11
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 282
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others β’ Mar 10 β’ 144
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others β’ Mar 4 β’ 74
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated Mar 3 β’ 25