Benjamin Clavié's picture

Benjamin Clavié

bclavie

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago
sbintuitions/modernbert-ja-130m
updated a model 12 days ago
answerdotai/ModernBERT-Large-Instruct
published a model 16 days ago
answerdotai/ModernBERT-Large-Instruct
View all activity

Organizations

Answer.AI's profile picture Bert ... but new's profile picture

bclavie's activity

New activity in Alibaba-NLP/gte-modernbert-base about 1 month ago

COIR Repro

2
#3 opened about 1 month ago by
bclavie
replied to MoritzLaurer's post about 2 months ago
view reply

On entailment adjacent tasks (which btw, great work on the zero-shot NLI models @MoritzLaurer !), I'd expect DeBERTa to be slightly better than ModernBERT -- it seems its pretraining objective is better aligned with it. In our evals, we consistently had DeBERTa come on top on MNLI (there's a full GLUE table in the appendix of the paper), it's only on aggregated GLUE that we saw ModernBERT-Base beat DeBERTaV3-Base.