The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Paper • 2504.17768 • Published Apr 24 • 13
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
Aya 23: Open Weight Releases to Further Multilingual Progress Paper • 2405.15032 • Published May 23, 2024 • 32
Cohere/wikipedia-2023-11-embed-multilingual-v3-int8-binary Viewer • Updated Mar 21, 2024 • 247M • 716 • 45
Cohere/wikipedia-2023-11-embed-multilingual-v3 Viewer • Updated Mar 19, 2024 • 247M • 6.17k • 234
An overview of gradient descent optimization algorithms Paper • 1609.04747 • Published Sep 15, 2016
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9, 2024 • 57
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning Paper • 2311.11077 • Published Nov 18, 2023 • 28
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages Paper • 2305.06897 • Published May 11, 2023 • 9