mdocekal's picture
Update README.md
7116e96 verified
metadata
library_name: transformers
datasets:
  - SoFairOA/sofair_softcite_somesci
metrics:
  - recall
  - precision
  - f1
  - accuracy
base_model:
  - answerdotai/ModernBERT-base

SoFair ModernBERT base filter

Fine-tuned ModernBERT to identify candidate documents for software mention extraction.

It was trained on SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) to classify whether the given document contains at least one annotation.

Usage

We created https://github.com/SoFairOA/filter, a simple command-line tool to use this model for processing a collection of documents.

Evaluation

We evaluated this model on the test set of SoFairOA/sofair_softcite_somesci (sofair_softcite_somesci_documents) dataset:

precision 0.8625730994152047
recall 0.9104938271604939
f1 0.8858858858858859
accuracy 0.9268527430221367