62 186 657

Gabriele Sarti PRO

gsarti

yjernite's profile picture

wassemgtk's profile picture

admarcosai's profile picture

https://gsarti.com

gsarti_
gsarti
gabrielesarti
gsarti.com

AI & ML interests

Interpretability for generative language models

Recent Activity

liked a model 2 days ago

chandar-lab/NeoBERT

updated a collection 5 days ago

🇮🇹 Italian NLP Resources

liked a model 5 days ago

Fastweb/FastwebMIIA-7B

View all activity

Organizations

gsarti 's collections 7

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 25 days ago • 164
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

Paper • 2505.23183 • Published 30 days ago • 2
Improved Representation Steering for Language Models

Paper • 2505.20809 • Published May 27 • 1
SAEs Are Good for Steering -- If You Select the Right Features

Paper • 2505.20063 • Published May 26 • 1

🇮🇹 IT5 @ LREC/COLING 2024

Materials for the paper "IT5:Text-to-text Pretraining for Italian Language Understanding and Generation" published at LREC/COLING 2024

IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation

Paper • 2203.03759 • Published Mar 7, 2022 • 5
Running

6

6

IT5 Demo

🤌

Test fine-tuned IT5 models for Italian language generation
gsarti/itagen

Preview • Updated Apr 26, 2022 • 9
gsarti/clean_mc4_it

Updated Jun 17, 2024 • 334 • 15

✍️ QE4PE & GroTE

Materials for "QE4PE: Word-level Quality Estimation for Human Post-Editing"

QE4PE: Word-level Quality Estimation for Human Post-Editing

Paper • 2503.03044 • Published Mar 4 • 6
gsarti/qe4pe

Viewer • Updated Apr 30 • 12.2k • 918 • 4
Running

1

1

GroTE

🐮

Post-editing Interface with Highlight Support

🧩 Verbalized Rebus @ CLiC-it 2024

Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses"

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1, 2024 • 7
gsarti/eureka-rebus

Viewer • Updated Mar 20 • 307k • 125 • 1
gsarti/eureka-rebus-calamita-2024

Viewer • Updated Mar 31 • 83.3k • 17 • 1
Kamyar-zeinalipour/ITA_CW

Viewer • Updated Oct 19, 2023 • 125k • 12 • 1

🐑🐑 PECoRe @ ICLR 2024

Resources for the paper "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" (Sarti et al. 2024) published in ICLR 2024

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

Paper • 2310.01188 • Published Oct 2, 2023 • 1
Runtime error

15

15

PECoRe

🐑

Analyze context usage in LM generations with model internals
gsarti/iwslt2017_context

Viewer • Updated May 7, 2023 • 5.55M • 189 • 1
inseq/scat

Updated Mar 10, 2024 • 126 • 1

🇮🇹 Italian NLP Resources

Collection of models, datasets and demos relevant to Italian NLP 🇮🇹

gsarti/it5-base

Text2Text Generation • Updated Jun 17, 2024 • 494 • 24
z-uo/squad-it

Viewer • Updated Oct 25, 2022 • 2 • 113 • 1
gsarti/clean_mc4_it

Updated Jun 17, 2024 • 334 • 15
gsarti/itacola

Updated Jul 1, 2022 • 128 • 2

🧩 Word games

A collection of resources for word games in various languages

artemsnegirev/ru-word-games

Viewer • Updated Apr 29, 2023 • 133k • 51 • 6
albertxu/CrosswordQA

Viewer • Updated Oct 29, 2022 • 6.78M • 130 • 6
anishthalamati/nyt-connections

Viewer • Updated Mar 29, 2024 • 915 • 18 • 2
Kamyar-zeinalipour/ITA_CW

Viewer • Updated Oct 19, 2023 • 125k • 12 • 1

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 25 days ago • 164
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

Paper • 2505.23183 • Published 30 days ago • 2
Improved Representation Steering for Language Models

Paper • 2505.20809 • Published May 27 • 1
SAEs Are Good for Steering -- If You Select the Right Features

Paper • 2505.20063 • Published May 26 • 1

🐑🐑 PECoRe @ ICLR 2024

Resources for the paper "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" (Sarti et al. 2024) published in ICLR 2024

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

Paper • 2310.01188 • Published Oct 2, 2023 • 1
Runtime error

15

15

PECoRe

🐑

Analyze context usage in LM generations with model internals
gsarti/iwslt2017_context

Viewer • Updated May 7, 2023 • 5.55M • 189 • 1
inseq/scat

Updated Mar 10, 2024 • 126 • 1

🇮🇹 IT5 @ LREC/COLING 2024

Materials for the paper "IT5:Text-to-text Pretraining for Italian Language Understanding and Generation" published at LREC/COLING 2024

IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation

Paper • 2203.03759 • Published Mar 7, 2022 • 5
Running

6

6

IT5 Demo

🤌

Test fine-tuned IT5 models for Italian language generation
gsarti/itagen

Preview • Updated Apr 26, 2022 • 9
gsarti/clean_mc4_it

Updated Jun 17, 2024 • 334 • 15

🇮🇹 Italian NLP Resources

Collection of models, datasets and demos relevant to Italian NLP 🇮🇹

gsarti/it5-base

Text2Text Generation • Updated Jun 17, 2024 • 494 • 24
z-uo/squad-it

Viewer • Updated Oct 25, 2022 • 2 • 113 • 1
gsarti/clean_mc4_it

Updated Jun 17, 2024 • 334 • 15
gsarti/itacola

Updated Jul 1, 2022 • 128 • 2

✍️ QE4PE & GroTE

Materials for "QE4PE: Word-level Quality Estimation for Human Post-Editing"

QE4PE: Word-level Quality Estimation for Human Post-Editing

Paper • 2503.03044 • Published Mar 4 • 6
gsarti/qe4pe

Viewer • Updated Apr 30 • 12.2k • 918 • 4
Running

1

1

GroTE

🐮

Post-editing Interface with Highlight Support

🧩 Word games

A collection of resources for word games in various languages

artemsnegirev/ru-word-games

Viewer • Updated Apr 29, 2023 • 133k • 51 • 6
albertxu/CrosswordQA

Viewer • Updated Oct 29, 2022 • 6.78M • 130 • 6
anishthalamati/nyt-connections

Viewer • Updated Mar 29, 2024 • 915 • 18 • 2
Kamyar-zeinalipour/ITA_CW

Viewer • Updated Oct 19, 2023 • 125k • 12 • 1

🧩 Verbalized Rebus @ CLiC-it 2024

Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses"

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1, 2024 • 7
gsarti/eureka-rebus

Viewer • Updated Mar 20 • 307k • 125 • 1
gsarti/eureka-rebus-calamita-2024

Viewer • Updated Mar 31 • 83.3k • 17 • 1
Kamyar-zeinalipour/ITA_CW

Viewer • Updated Oct 19, 2023 • 125k • 12 • 1

Gabriele Sarti PRO

AI & ML interests

Recent Activity

Organizations

gsarti 's collections 7

IT5 Demo

GroTE

PECoRe

PECoRe

IT5 Demo

GroTE