End of training

Browse files

Files changed (4) hide show

README.md +13 -16
model.safetensors +1 -1
runs/Jan10_22-06-41_5e4eb05f69bb/events.out.tfevents.1736546813.5e4eb05f69bb.2578.1 +2 -2
tokenizer.json +14 -2

README.md CHANGED Viewed

@@ -4,6 +4,8 @@ license: apache-2.0
 base_model: answerdotai/ModernBERT-base
 tags:
 - generated_from_trainer
 model-index:
 - name: ModernBERT-hf-posts-classifier
   results: []
@@ -16,10 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2372
-- Micro F1: 0.4251
-- Macro F1: 0.0459
-- Weighted F1: 0.2837
 ## Model description
@@ -39,30 +39,27 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 32
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
-- mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Micro F1 | Macro F1 | Weighted F1 |
-|:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|
-| No log        | 1.0    | 15   | 0.2547          | 0.4621   | 0.0473   | 0.2926      |
-| No log        | 2.0    | 30   | 0.2524          | 0.3676   | 0.0386   | 0.2377      |
-| No log        | 3.0    | 45   | 0.2412          | 0.4291   | 0.0491   | 0.2895      |
-| No log        | 4.0    | 60   | 0.2378          | 0.4291   | 0.0460   | 0.2846      |
-| No log        | 4.7018 | 70   | 0.2372          | 0.4251   | 0.0459   | 0.2837      |
 ### Framework versions
-- Transformers 4.48.0.dev0
 - Pytorch 2.5.0+cu124
 - Datasets 3.1.0
 - Tokenizers 0.21.0

 base_model: answerdotai/ModernBERT-base
 tags:
 - generated_from_trainer
+metrics:
+- f1
 model-index:
 - name: ModernBERT-hf-posts-classifier
   results: []
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3951
+- F1: 0.6703
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| No log        | 1.0   | 26   | 1.2084          | 0.6381 |
+| No log        | 2.0   | 52   | 1.7850          | 0.5018 |
+| No log        | 3.0   | 78   | 1.1985          | 0.7118 |
+| 0.4128        | 4.0   | 104  | 1.3353          | 0.6716 |
+| 0.4128        | 5.0   | 130  | 1.3951          | 0.6703 |
 ### Framework versions
+- Transformers 4.48.0
 - Pytorch 2.5.0+cu124
 - Datasets 3.1.0
 - Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:129d1974934c34fdd86afe34846d4fb97fdafb4b3ac1dabb7fd08297d69a0c0b
 size 598476704

 version https://git-lfs.github.com/spec/v1
+oid sha256:54c90872842f81bba5861b59b0680e651914d891ee5ecef6a67d367d93fba3f6
 size 598476704

runs/Jan10_22-06-41_5e4eb05f69bb/events.out.tfevents.1736546813.5e4eb05f69bb.2578.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5cdad69d94039983046c7ba7d782cba75e585498edef9e39378b6b2aee3c79d4
-size 8339

 version https://git-lfs.github.com/spec/v1
+oid sha256:7615d2a2daf57630e6c598b5270948f3128368025ad0a06c53fb1660f321e65a
+size 8693

tokenizer.json CHANGED Viewed

@@ -1,7 +1,19 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 512,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": "BatchLongest",
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 50283,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,