How to use ModernBERT with the AutoModelForQuestionAnswering class?

#15

by sraj - opened Dec 23, 2024

sraj

Dec 23, 2024

I wanted to fine tune the model on the SQUAD dataset. But currently AutoModelForQuestionAnswering does not work with ModernBERT. Is there a workaround for the moment where I can train ModernBERT for QA tasks given a context?

bclavie

Answer.AI org Dec 23, 2024

Hey! This is planned, but indeed missing at the moment.

cc @NohTow @bwarner @wgpubs @tomaarsen , we should ensure our January update includes the missing heads (QA, MultipleChoice)

AkimfromParis

Jan 7

Thank you very much for ModernBERT!
@bclavie It will be amazing to have TokenClassification also. : )

saattrupdan

Jan 12

•

edited Jan 12

@bclavie @NohTow @bwarner @wgpubs @tomaarsen Do any of you have an update on this?

saattrupdan

Feb 17

@bclavie @NohTow @bwarner @wgpubs @tomaarsen Can you check if this script implements the ModernBertForQuestionAnswering correctly, and if so, copy the script into your repo here and the large version? Then we would be able to use the models for that if we enable trust_remote_code 🙂

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment