1 5

Sebastian Ruder

ruder

https://ruder.io/

AI & ML interests

Natural language processing, multilingual NLP, transfer learning

Recent Activity

authored a paper about 1 month ago

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

authored a paper 8 months ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

authored a paper about 1 year ago

Aya 23: Open Weight Releases to Further Multilingual Progress

View all activity

Organizations

ruder's activity

authored a paper about 1 month ago

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

Paper • 2504.17768 • Published Apr 24 • 13

authored a paper 8 months ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12

authored a paper about 1 year ago

Aya 23: Open Weight Releases to Further Multilingual Progress

Paper • 2405.15032 • Published May 23, 2024 • 32

New activity in CohereLabs/c4ai-command-r-plus about 1 year ago

Model goes crazy if it tries to translate a large amount of Japanese text

👀 1

#9 opened about 1 year ago by

nonetrix

liked 2 models about 1 year ago

CohereLabs/c4ai-command-r-plus-4bit

Text Generation • Updated Apr 16 • 169 • 251

CohereLabs/c4ai-command-r-plus

Text Generation • Updated Apr 16 • 3.39k • • 1.73k

liked 2 datasets about 1 year ago

Cohere/wikipedia-2023-11-embed-multilingual-v3-int8-binary

Viewer • Updated Mar 21, 2024 • 247M • 716 • 45

Cohere/wikipedia-2023-11-embed-multilingual-v3

Viewer • Updated Mar 19, 2024 • 247M • 6.17k • 234

liked a model about 1 year ago

CohereLabs/c4ai-command-r-v01

Text Generation • Updated Apr 16 • 10.3k • • 1.09k

authored 3 papers over 1 year ago

An overview of gradient descent optimization algorithms

Paper • 1609.04747 • Published Sep 15, 2016

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9, 2024 • 57

Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning

Paper • 2311.11077 • Published Nov 18, 2023 • 28

authored a paper almost 2 years ago

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

Paper • 2305.06897 • Published May 11, 2023 • 9

authored a paper about 2 years ago

PaLM 2 Technical Report

Paper • 2305.10403 • Published May 17, 2023 • 6