focal_modernbert_punctuation_128_v3

This model is a fine-tuned version of answerdotai/ModernBERT-large on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0188
  • Accuracy: 0.9791
  • Precision O: 0.9900
  • Recall O: 0.9922
  • F1 O: 0.9911
  • Precision Comma: 0.8448
  • Recall Comma: 0.8192
  • F1 Comma: 0.8318
  • Precision Period: 0.9060
  • Recall Period: 0.8964
  • F1 Period: 0.9011
  • Precision Question: 0.8412
  • Recall Question: 0.8171
  • F1 Question: 0.8290
  • Precision Exclamation: 0.0
  • Recall Exclamation: 0.0
  • F1 Exclamation: 0.0
  • Precision Macro: 0.8955
  • Recall Macro: 0.8812
  • F1 Macro: 0.8883

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 128
  • eval_batch_size: 16
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 8
  • total_train_batch_size: 1024
  • total_eval_batch_size: 128
  • optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_steps: 100
  • num_epochs: 3

Training results

Training Loss Epoch Step Validation Loss Accuracy Precision O Recall O F1 O Precision Comma Recall Comma F1 Comma Precision Period Recall Period F1 Period Precision Question Recall Question F1 Question Precision Exclamation Recall Exclamation F1 Exclamation Precision Macro Recall Macro F1 Macro
0.0691 0.0291 100 0.0554 0.9472 0.9647 0.9855 0.9750 0.6843 0.4120 0.5144 0.7683 0.7905 0.7793 0.6471 0.44 0.5238 0.0 0.0 0.0 0.7661 0.6570 0.6981
0.0352 0.0582 200 0.0336 0.9661 0.9827 0.9878 0.9853 0.7387 0.7133 0.7258 0.8767 0.8151 0.8448 0.7428 0.7343 0.7385 0.0 0.0 0.0 0.8352 0.8126 0.8236
0.0298 0.0872 300 0.0291 0.9699 0.9827 0.9910 0.9868 0.7838 0.7230 0.7522 0.9003 0.8254 0.8612 0.7944 0.7286 0.7601 0.0 0.0 0.0 0.8653 0.8170 0.8401
0.0278 0.1163 400 0.0268 0.9715 0.9847 0.9902 0.9875 0.8139 0.7232 0.7659 0.8622 0.8796 0.8708 0.8203 0.7171 0.7652 0.0 0.0 0.0 0.8703 0.8275 0.8473
0.0265 0.1454 500 0.0255 0.9723 0.9849 0.9909 0.9879 0.8109 0.7448 0.7764 0.8824 0.8588 0.8705 0.8185 0.7343 0.7741 0.0 0.0 0.0 0.8742 0.8322 0.8522
0.0259 0.1745 600 0.0248 0.9730 0.9868 0.9898 0.9883 0.7960 0.7727 0.7842 0.8866 0.8645 0.8754 0.7944 0.7286 0.7601 0.0 0.0 0.0 0.8660 0.8389 0.8520
0.0255 0.2035 700 0.0244 0.9730 0.9858 0.9910 0.9884 0.8392 0.7209 0.7756 0.8515 0.9024 0.8762 0.7950 0.72 0.7556 0.0 0.0 0.0 0.8679 0.8336 0.8489
0.0239 0.2326 800 0.0236 0.9739 0.9846 0.9926 0.9886 0.8433 0.7263 0.7804 0.8806 0.8840 0.8823 0.8638 0.6886 0.7663 0.0 0.0 0.0 0.8931 0.8228 0.8544
0.0248 0.2617 900 0.0237 0.9736 0.9859 0.9913 0.9886 0.8302 0.7356 0.7801 0.8693 0.8903 0.8797 0.8153 0.7314 0.7711 0.0 0.0 0.0 0.8752 0.8372 0.8549
0.0235 0.2908 1000 0.0231 0.9740 0.9881 0.9896 0.9889 0.8170 0.7641 0.7897 0.8610 0.9 0.8801 0.7907 0.7771 0.7839 0.0 0.0 0.0 0.8642 0.8577 0.8606
0.0233 0.3199 1100 0.0225 0.9747 0.9890 0.9892 0.9891 0.7888 0.8137 0.8011 0.8966 0.8723 0.8842 0.8769 0.6714 0.7605 0.0 0.0 0.0 0.8878 0.8366 0.8587
0.023 0.3489 1200 0.0220 0.9752 0.9874 0.9911 0.9893 0.8251 0.7806 0.8022 0.8899 0.8810 0.8854 0.8032 0.7114 0.7545 0.0 0.0 0.0 0.8764 0.8410 0.8579
0.0228 0.3780 1300 0.0224 0.9747 0.9903 0.9879 0.9891 0.7874 0.8212 0.8040 0.8853 0.8840 0.8846 0.7686 0.7971 0.7826 0.0 0.0 0.0 0.8579 0.8726 0.8651
0.0227 0.4071 1400 0.0220 0.9752 0.9869 0.9916 0.9893 0.8254 0.7799 0.8020 0.8999 0.8666 0.8829 0.7549 0.7657 0.7603 0.0 0.0 0.0 0.8668 0.8510 0.8586
0.0226 0.4362 1500 0.0220 0.9749 0.9872 0.9911 0.9892 0.8095 0.7854 0.7973 0.9043 0.8642 0.8838 0.8259 0.7457 0.7838 0.0 0.0 0.0 0.8817 0.8466 0.8635
0.0223 0.4653 1600 0.0218 0.9755 0.9883 0.9907 0.9895 0.8236 0.7835 0.8031 0.8840 0.8897 0.8868 0.7794 0.7571 0.7681 0.0 0.0 0.0 0.8688 0.8553 0.8619
0.0225 0.4943 1700 0.0213 0.9756 0.9870 0.9920 0.9895 0.8336 0.7720 0.8016 0.8936 0.8815 0.8875 0.8515 0.7371 0.7902 0.0 0.0 0.0 0.8914 0.8456 0.8672
0.0218 0.5234 1800 0.0211 0.9759 0.9879 0.9915 0.9897 0.8160 0.7960 0.8059 0.9068 0.8700 0.8881 0.8416 0.7286 0.7810 0.0 0.0 0.0 0.8881 0.8465 0.8662
0.0218 0.5525 1900 0.0207 0.9762 0.9873 0.9923 0.9898 0.8390 0.7737 0.8051 0.8938 0.8854 0.8896 0.8113 0.7371 0.7725 0.0 0.0 0.0 0.8829 0.8471 0.8642
0.0216 0.5816 2000 0.0207 0.9762 0.9894 0.9902 0.9898 0.8136 0.8082 0.8109 0.8922 0.8883 0.8902 0.8176 0.7429 0.7784 0.0 0.0 0.0 0.8782 0.8574 0.8673
0.0215 0.6106 2100 0.0208 0.9763 0.9872 0.9926 0.9899 0.8491 0.7642 0.8044 0.8897 0.8899 0.8898 0.7778 0.78 0.7789 0.0 0.0 0.0 0.8759 0.8567 0.8657
0.0216 0.6397 2200 0.0208 0.9761 0.9865 0.9930 0.9897 0.8354 0.7747 0.8039 0.9125 0.8645 0.8879 0.8226 0.7686 0.7947 0.0 0.0 0.0 0.8893 0.8502 0.8690
0.0212 0.6688 2300 0.0207 0.9761 0.9869 0.9924 0.9897 0.8440 0.7669 0.8036 0.8963 0.8864 0.8913 0.7670 0.7714 0.7692 0.0 0.0 0.0 0.8736 0.8543 0.8635
0.0213 0.6979 2400 0.0203 0.9761 0.9884 0.9913 0.9899 0.8491 0.7658 0.8053 0.8627 0.9144 0.8878 0.8482 0.7343 0.7871 0.0 0.0 0.0 0.8871 0.8515 0.8675
0.0212 0.7270 2500 0.0202 0.9768 0.9902 0.9898 0.9900 0.8145 0.8209 0.8177 0.8945 0.8933 0.8939 0.7954 0.7886 0.7920 0.0 0.0 0.0 0.8736 0.8732 0.8734
0.0213 0.7560 2600 0.0200 0.9769 0.9879 0.9924 0.9901 0.8502 0.7759 0.8114 0.8866 0.8975 0.8920 0.8312 0.7457 0.7861 0.0 0.0 0.0 0.8890 0.8529 0.8699
0.0208 0.7851 2700 0.0199 0.9770 0.9871 0.9930 0.9901 0.8471 0.7771 0.8106 0.9040 0.8864 0.8951 0.8723 0.7029 0.7785 0.0 0.0 0.0 0.9026 0.8398 0.8686
0.0207 0.8142 2800 0.0197 0.9770 0.9879 0.9925 0.9902 0.8498 0.7764 0.8115 0.8889 0.8952 0.8921 0.8272 0.7657 0.7953 0.0 0.0 0.0 0.8884 0.8575 0.8722
0.0203 0.8433 2900 0.0197 0.9769 0.9880 0.9922 0.9901 0.8401 0.7886 0.8136 0.8962 0.8865 0.8913 0.8317 0.7343 0.7800 0.0 0.0 0.0 0.8890 0.8504 0.8687
0.0205 0.8723 3000 0.0197 0.9768 0.9871 0.9929 0.9900 0.8510 0.7701 0.8086 0.8981 0.8900 0.8941 0.8090 0.7743 0.7912 0.0 0.0 0.0 0.8863 0.8568 0.8710
0.0206 0.9014 3100 0.0196 0.9771 0.9885 0.9917 0.9901 0.8345 0.7990 0.8164 0.8992 0.8894 0.8943 0.8388 0.7286 0.7798 0.0 0.0 0.0 0.8903 0.8522 0.8701
0.0206 0.9305 3200 0.0196 0.9769 0.9874 0.9927 0.9900 0.8490 0.7751 0.8103 0.8970 0.8919 0.8945 0.8354 0.7543 0.7928 0.0 0.0 0.0 0.8922 0.8535 0.8719
0.0206 0.9596 3300 0.0199 0.9765 0.9908 0.9893 0.9901 0.7870 0.8513 0.8179 0.9198 0.8597 0.8888 0.7815 0.7971 0.7893 0.0 0.0 0.0 0.8698 0.8744 0.8715
0.0203 0.9887 3400 0.0194 0.9774 0.9889 0.9916 0.9903 0.8494 0.7842 0.8155 0.8825 0.9103 0.8962 0.8102 0.8171 0.8137 0.0 0.0 0.0 0.8827 0.8758 0.8789
0.0184 1.0177 3500 0.0194 0.9774 0.9893 0.9913 0.9903 0.8403 0.7976 0.8184 0.8855 0.9057 0.8955 0.8471 0.76 0.8012 0.0 0.0 0.0 0.8906 0.8636 0.8763
0.0182 1.0468 3600 0.0196 0.9777 0.9890 0.9919 0.9904 0.8480 0.7922 0.8192 0.8888 0.9033 0.8960 0.8182 0.8229 0.8205 0.0 0.0 0.0 0.8860 0.8776 0.8815
0.0184 1.0759 3700 0.0194 0.9772 0.9901 0.9905 0.9903 0.8177 0.8228 0.8202 0.8948 0.8926 0.8937 0.8782 0.68 0.7665 0.0 0.0 0.0 0.8952 0.8465 0.8677
0.0182 1.1050 3800 0.0195 0.9775 0.9871 0.9935 0.9903 0.8622 0.7668 0.8117 0.9003 0.8930 0.8967 0.8132 0.8086 0.8109 0.0 0.0 0.0 0.8907 0.8655 0.8774
0.0186 1.1341 3900 0.0194 0.9778 0.9885 0.9925 0.9905 0.8465 0.7952 0.8201 0.9031 0.8894 0.8962 0.8171 0.8171 0.8171 0.0 0.0 0.0 0.8888 0.8736 0.8810
0.0185 1.1631 4000 0.0193 0.9778 0.9899 0.9912 0.9905 0.8244 0.8227 0.8235 0.9095 0.8835 0.8963 0.7949 0.8086 0.8017 0.0 0.0 0.0 0.8797 0.8765 0.8780
0.0184 1.1922 4100 0.0193 0.9774 0.9896 0.9912 0.9904 0.8292 0.8086 0.8188 0.8930 0.8941 0.8936 0.8479 0.7486 0.7951 0.0 0.0 0.0 0.8899 0.8606 0.8745
0.0185 1.2213 4200 0.0193 0.9776 0.9885 0.9923 0.9904 0.8403 0.7968 0.8180 0.9050 0.8887 0.8968 0.8287 0.7743 0.8006 0.0 0.0 0.0 0.8906 0.8630 0.8764
0.0182 1.2504 4300 0.0194 0.9774 0.9881 0.9924 0.9903 0.8476 0.7941 0.8200 0.9024 0.8834 0.8928 0.7793 0.8171 0.7978 0.0 0.0 0.0 0.8793 0.8718 0.8752
0.0185 1.2794 4400 0.0194 0.9779 0.9897 0.9915 0.9906 0.8278 0.8201 0.8240 0.9097 0.8834 0.8964 0.8011 0.8057 0.8034 0.0 0.0 0.0 0.8821 0.8752 0.8786
0.0179 1.3085 4500 0.0191 0.9775 0.9888 0.9921 0.9905 0.8354 0.8016 0.8182 0.9031 0.8830 0.8929 0.8216 0.8029 0.8121 0.0 0.0 0.0 0.8872 0.8699 0.8784
0.0184 1.3376 4600 0.0192 0.9779 0.9896 0.9915 0.9906 0.8380 0.8075 0.8225 0.8979 0.8973 0.8976 0.7817 0.8286 0.8044 0.0 0.0 0.0 0.8768 0.8812 0.8788
0.0187 1.3667 4700 0.0193 0.9776 0.9889 0.9919 0.9904 0.8334 0.8061 0.8195 0.9057 0.8859 0.8957 0.8471 0.76 0.8012 0.0 0.0 0.0 0.8938 0.8610 0.8767
0.0181 1.3958 4800 0.0192 0.9779 0.9892 0.9921 0.9907 0.8318 0.8122 0.8219 0.9096 0.8818 0.8955 0.8452 0.78 0.8113 0.0 0.0 0.0 0.8939 0.8665 0.8798
0.0178 1.4248 4900 0.0190 0.9780 0.9903 0.9911 0.9907 0.8259 0.8248 0.8254 0.9033 0.8892 0.8962 0.8300 0.8229 0.8264 0.0 0.0 0.0 0.8874 0.8820 0.8847
0.0183 1.4539 5000 0.0190 0.9776 0.9880 0.9930 0.9905 0.8529 0.7817 0.8157 0.8987 0.8938 0.8962 0.8232 0.7714 0.7965 0.0 0.0 0.0 0.8907 0.8600 0.8747
0.0186 1.4830 5100 0.0188 0.9779 0.9895 0.9918 0.9907 0.8356 0.8108 0.8230 0.9010 0.8897 0.8953 0.8142 0.7886 0.8012 0.0 0.0 0.0 0.8851 0.8702 0.8775
0.0182 1.5121 5200 0.0189 0.9775 0.9881 0.9928 0.9904 0.8421 0.7896 0.8150 0.9057 0.8859 0.8957 0.8632 0.7571 0.8067 0.0 0.0 0.0 0.8998 0.8563 0.8769
0.018 1.5411 5300 0.0186 0.9779 0.9888 0.9923 0.9906 0.8488 0.7943 0.8206 0.8974 0.8981 0.8977 0.8109 0.8086 0.8097 0.0 0.0 0.0 0.8865 0.8733 0.8797
0.0178 1.5702 5400 0.0187 0.9781 0.9897 0.9919 0.9908 0.8385 0.8093 0.8236 0.8965 0.8960 0.8963 0.8476 0.7629 0.8030 0.0 0.0 0.0 0.8931 0.8650 0.8784
0.0182 1.5993 5500 0.0186 0.9778 0.9886 0.9925 0.9906 0.8442 0.7987 0.8208 0.9005 0.8897 0.8951 0.8557 0.7286 0.7870 0.0 0.0 0.0 0.8973 0.8524 0.8734
0.0176 1.6284 5600 0.0186 0.9779 0.9905 0.9907 0.9906 0.8171 0.8325 0.8247 0.9088 0.8875 0.8980 0.8269 0.7914 0.8088 0.0 0.0 0.0 0.8858 0.8755 0.8805
0.0177 1.6575 5700 0.0186 0.9783 0.9896 0.9920 0.9908 0.8399 0.8079 0.8236 0.9022 0.8975 0.8998 0.8328 0.7971 0.8146 0.0 0.0 0.0 0.8911 0.8736 0.8822
0.0176 1.6865 5800 0.0184 0.9783 0.9888 0.9926 0.9907 0.8531 0.7976 0.8244 0.8999 0.8960 0.8980 0.8285 0.8143 0.8213 0.0 0.0 0.0 0.8926 0.8751 0.8836
0.0177 1.7156 5900 0.0186 0.9785 0.9892 0.9926 0.9909 0.8414 0.8091 0.8250 0.9130 0.8883 0.9005 0.8271 0.82 0.8235 0.0 0.0 0.0 0.8927 0.8775 0.8850
0.0178 1.7447 6000 0.0183 0.9783 0.9909 0.9906 0.9908 0.8207 0.8373 0.8289 0.9049 0.8930 0.8989 0.8466 0.7886 0.8166 0.0 0.0 0.0 0.8908 0.8774 0.8838
0.0176 1.7738 6100 0.0184 0.9784 0.9899 0.9917 0.9908 0.8386 0.8180 0.8282 0.9007 0.8970 0.8988 0.8680 0.7514 0.8055 0.0 0.0 0.0 0.8993 0.8645 0.8808
0.0177 1.8028 6200 0.0181 0.9786 0.9890 0.9928 0.9909 0.8439 0.8124 0.8279 0.9145 0.8845 0.8992 0.8431 0.7829 0.8119 0.0 0.0 0.0 0.8976 0.8681 0.8825
0.0177 1.8319 6300 0.0182 0.9785 0.9891 0.9927 0.9909 0.8475 0.8038 0.8251 0.9090 0.8908 0.8998 0.7967 0.8286 0.8123 0.0 0.0 0.0 0.8856 0.8790 0.8820
0.0178 1.8610 6400 0.0184 0.9785 0.9889 0.9928 0.9909 0.8459 0.8085 0.8268 0.9100 0.8848 0.8972 0.8273 0.78 0.8029 0.0 0.0 0.0 0.8930 0.8665 0.8795
0.0178 1.8901 6500 0.0183 0.9784 0.9901 0.9916 0.9908 0.8397 0.8148 0.8270 0.8955 0.9022 0.8989 0.8395 0.7771 0.8071 0.0 0.0 0.0 0.8912 0.8714 0.8810
0.0175 1.9192 6600 0.0183 0.9785 0.9882 0.9934 0.9908 0.8531 0.7957 0.8234 0.9140 0.8887 0.9012 0.8567 0.7514 0.8006 0.0 0.0 0.0 0.9030 0.8573 0.8790
0.0178 1.9482 6700 0.0180 0.9788 0.9896 0.9922 0.9909 0.8417 0.8179 0.8296 0.9095 0.8940 0.9017 0.8508 0.7657 0.8060 0.0 0.0 0.0 0.8979 0.8674 0.8821
0.0175 1.9773 6800 0.0181 0.9786 0.9902 0.9916 0.9909 0.8307 0.8290 0.8298 0.9094 0.8911 0.9002 0.8489 0.7543 0.7988 0.0 0.0 0.0 0.8948 0.8665 0.8799
0.0152 2.0064 6900 0.0186 0.9787 0.9888 0.9929 0.9909 0.8550 0.7981 0.8256 0.9059 0.8968 0.9013 0.8094 0.8371 0.8230 0.0 0.0 0.0 0.8898 0.8812 0.8852
0.0152 2.0355 7000 0.0188 0.9784 0.9894 0.9922 0.9908 0.8420 0.8090 0.8252 0.9014 0.8967 0.8990 0.8656 0.7543 0.8061 0.0 0.0 0.0 0.8996 0.8630 0.8803
0.0151 2.0646 7100 0.0190 0.9784 0.9904 0.9913 0.9909 0.8249 0.8321 0.8285 0.9108 0.8854 0.8979 0.8328 0.8114 0.8220 0.0 0.0 0.0 0.8897 0.8801 0.8848
0.0152 2.0936 7200 0.0189 0.9786 0.9899 0.9919 0.9909 0.8353 0.8184 0.8268 0.9097 0.8913 0.9004 0.8164 0.8257 0.8210 0.0 0.0 0.0 0.8878 0.8818 0.8848
0.0152 2.1227 7300 0.0188 0.9785 0.9901 0.9916 0.9909 0.8333 0.8219 0.8275 0.9070 0.8930 0.8999 0.8256 0.8114 0.8184 0.0 0.0 0.0 0.8890 0.8795 0.8842
0.015 2.1518 7400 0.0192 0.9786 0.9893 0.9926 0.9909 0.8490 0.8039 0.8258 0.9040 0.8938 0.8989 0.8101 0.8286 0.8192 0.0 0.0 0.0 0.8881 0.8797 0.8837
0.0149 2.1809 7500 0.0189 0.9785 0.9896 0.9920 0.9908 0.8444 0.8085 0.8260 0.9012 0.9002 0.9007 0.8251 0.8086 0.8167 0.0 0.0 0.0 0.8900 0.8773 0.8836
0.0148 2.2099 7600 0.0190 0.9785 0.9902 0.9917 0.9909 0.8356 0.8195 0.8275 0.9018 0.8984 0.9001 0.8558 0.7629 0.8066 0.0 0.0 0.0 0.8958 0.8681 0.8813
0.0148 2.2390 7700 0.0189 0.9788 0.9902 0.9918 0.9910 0.8441 0.8164 0.8300 0.8982 0.9019 0.9000 0.8257 0.8257 0.8257 0.0 0.0 0.0 0.8896 0.8839 0.8867
0.0151 2.2681 7800 0.0189 0.9787 0.9902 0.9917 0.9910 0.8445 0.8133 0.8286 0.8941 0.9049 0.8995 0.8247 0.82 0.8223 0.0 0.0 0.0 0.8884 0.8825 0.8853
0.0149 2.2972 7900 0.0190 0.9787 0.9906 0.9912 0.9909 0.8334 0.8290 0.8312 0.9033 0.8946 0.8990 0.8049 0.8486 0.8261 0.0 0.0 0.0 0.8831 0.8909 0.8868
0.015 2.3263 8000 0.0188 0.9788 0.9895 0.9925 0.9910 0.8475 0.8097 0.8282 0.9030 0.8970 0.9000 0.8612 0.78 0.8186 0.0 0.0 0.0 0.9003 0.8698 0.8844
0.0149 2.3553 8100 0.0189 0.9787 0.9905 0.9913 0.9909 0.8278 0.8349 0.8313 0.9131 0.8889 0.9008 0.8353 0.8257 0.8305 0.0 0.0 0.0 0.8917 0.8852 0.8884
0.0148 2.3844 8200 0.0190 0.9787 0.9906 0.9913 0.9910 0.8297 0.8311 0.8304 0.9062 0.8946 0.9004 0.8503 0.8114 0.8304 0.0 0.0 0.0 0.8942 0.8821 0.8880
0.0146 2.4135 8300 0.0188 0.9788 0.9901 0.9918 0.9910 0.8407 0.8214 0.8309 0.9033 0.8984 0.9008 0.8502 0.7943 0.8213 0.0 0.0 0.0 0.8961 0.8765 0.8860
0.0145 2.4426 8400 0.0189 0.9787 0.9902 0.9917 0.9909 0.8386 0.8203 0.8293 0.9019 0.8986 0.9002 0.8464 0.8029 0.8240 0.0 0.0 0.0 0.8943 0.8783 0.8861
0.0151 2.4716 8500 0.0188 0.9789 0.9899 0.9922 0.9910 0.8442 0.8171 0.8304 0.9045 0.8975 0.9010 0.8498 0.8086 0.8287 0.0 0.0 0.0 0.8971 0.8788 0.8878
0.0146 2.5007 8600 0.0190 0.9788 0.9896 0.9923 0.9909 0.8463 0.8114 0.8285 0.9044 0.8976 0.9010 0.8448 0.8086 0.8263 0.0 0.0 0.0 0.8962 0.8775 0.8867
0.0147 2.5298 8700 0.0189 0.9788 0.9899 0.9921 0.9910 0.8401 0.8207 0.8303 0.9073 0.8930 0.9001 0.8466 0.7886 0.8166 0.0 0.0 0.0 0.8960 0.8736 0.8845
0.0149 2.5589 8800 0.0188 0.9789 0.9898 0.9923 0.9911 0.8424 0.8195 0.8308 0.9086 0.8913 0.8998 0.8507 0.8143 0.8321 0.0 0.0 0.0 0.8979 0.8793 0.8885
0.0146 2.5880 8900 0.0190 0.9790 0.9899 0.9923 0.9911 0.8442 0.8180 0.8309 0.9085 0.8943 0.9014 0.8305 0.8257 0.8281 0.0 0.0 0.0 0.8933 0.8826 0.8879
0.0148 2.6170 9000 0.0189 0.9791 0.9902 0.9921 0.9911 0.8425 0.8224 0.8324 0.9063 0.8967 0.9015 0.8343 0.82 0.8271 0.0 0.0 0.0 0.8933 0.8828 0.8880
0.0147 2.6461 9100 0.0189 0.9791 0.9899 0.9923 0.9911 0.8455 0.8188 0.8319 0.9076 0.8967 0.9021 0.8290 0.8171 0.8230 0.0 0.0 0.0 0.8930 0.8812 0.8870
0.0147 2.6752 9200 0.0189 0.9790 0.9899 0.9923 0.9911 0.8451 0.8179 0.8312 0.9071 0.8962 0.9016 0.8448 0.8086 0.8263 0.0 0.0 0.0 0.8967 0.8787 0.8876
0.0146 2.7043 9300 0.0188 0.9790 0.9897 0.9925 0.9911 0.8463 0.8136 0.8296 0.9062 0.8968 0.9015 0.8580 0.7943 0.8249 0.0 0.0 0.0 0.9000 0.8743 0.8868
0.0146 2.7334 9400 0.0188 0.9791 0.9901 0.9922 0.9911 0.8427 0.8235 0.8330 0.9092 0.8952 0.9022 0.8476 0.7943 0.8201 0.0 0.0 0.0 0.8974 0.8763 0.8866
0.0145 2.7624 9500 0.0188 0.9789 0.9898 0.9922 0.9910 0.8463 0.8146 0.8302 0.9039 0.8987 0.9013 0.8437 0.8171 0.8302 0.0 0.0 0.0 0.8959 0.8807 0.8882
0.0147 2.7915 9600 0.0189 0.9790 0.9901 0.9921 0.9911 0.8422 0.8218 0.8319 0.9068 0.8954 0.9010 0.8511 0.8 0.8247 0.0 0.0 0.0 0.8975 0.8773 0.8872
0.0146 2.8206 9700 0.0188 0.9790 0.9901 0.9921 0.9911 0.8436 0.8197 0.8315 0.9051 0.8973 0.9012 0.8473 0.8086 0.8275 0.0 0.0 0.0 0.8965 0.8794 0.8878
0.0148 2.8497 9800 0.0189 0.9790 0.9899 0.9923 0.9911 0.8459 0.8173 0.8314 0.9057 0.8968 0.9013 0.8427 0.8114 0.8268 0.0 0.0 0.0 0.8961 0.8795 0.8876
0.0149 2.8787 9900 0.0188 0.9791 0.9900 0.9922 0.9911 0.8440 0.8200 0.8318 0.9062 0.8956 0.9008 0.8452 0.8114 0.8280 0.0 0.0 0.0 0.8964 0.8798 0.8880
0.0147 2.9078 10000 0.0188 0.9791 0.9900 0.9923 0.9911 0.8451 0.8187 0.8317 0.9062 0.8959 0.9010 0.8437 0.8171 0.8302 0.0 0.0 0.0 0.8962 0.8810 0.8885
0.0147 2.9369 10100 0.0188 0.9791 0.9900 0.9923 0.9911 0.8450 0.8189 0.8318 0.9062 0.8962 0.9012 0.8412 0.8171 0.8290 0.0 0.0 0.0 0.8956 0.8811 0.8883
0.0147 2.9660 10200 0.0188 0.9791 0.9900 0.9923 0.9911 0.8450 0.8192 0.8319 0.9060 0.8964 0.9011 0.8412 0.8171 0.8290 0.0 0.0 0.0 0.8956 0.8812 0.8883
0.0143 2.9951 10300 0.0188 0.9791 0.9900 0.9922 0.9911 0.8448 0.8192 0.8318 0.9060 0.8964 0.9011 0.8412 0.8171 0.8290 0.0 0.0 0.0 0.8955 0.8812 0.8883

Framework versions

  • Transformers 4.49.0.dev0
  • Pytorch 2.6.0+cu124
  • Datasets 3.3.0
  • Tokenizers 0.21.0
Downloads last month
2
Safetensors
Model size
396M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for whooray/focal_modernbert_punctuation_128_v3

Finetuned
(75)
this model