youralien
/

roberta-Reflections-goodareas-eval_FeedbackESConv5pp_CARE10pp-sweeps-current

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [FacebookAI/roberta-large](https://huggingface.co/FacebookAI/roberta-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1875
-- Accuracy: 0.8922
-- Precision: 0.5294
-- Recall: 0.3103
-- F1: 0.3913
 ## Model description
@@ -44,29 +44,38 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2.056366063783178e-06
 - train_batch_size: 32
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.4035        | 1.0   | 134  | 0.2330          | 0.8883   | 0.0       | 0.0    | 0.0    |
-| 0.306         | 2.0   | 268  | 0.2158          | 0.8883   | 0.0       | 0.0    | 0.0    |
-| 0.2689        | 3.0   | 402  | 0.1797          | 0.8909   | 1.0       | 0.0230 | 0.0449 |
-| 0.2542        | 4.0   | 536  | 0.1749          | 0.8896   | 1.0       | 0.0115 | 0.0227 |
-| 0.2439        | 5.0   | 670  | 0.1724          | 0.8960   | 0.875     | 0.0805 | 0.1474 |
-| 0.2293        | 6.0   | 804  | 0.1815          | 0.8973   | 0.5745    | 0.3103 | 0.4030 |
-| 0.226         | 7.0   | 938  | 0.1883          | 0.8935   | 0.5417    | 0.2989 | 0.3852 |
-| 0.2272        | 8.0   | 1072 | 0.1847          | 0.8960   | 0.5714    | 0.2759 | 0.3721 |
-| 0.2193        | 9.0   | 1206 | 0.1907          | 0.8935   | 0.5370    | 0.3333 | 0.4113 |
-| 0.2186        | 10.0  | 1340 | 0.1875          | 0.8922   | 0.5294    | 0.3103 | 0.3913 |
 ### Framework versions

 This model is a fine-tuned version of [FacebookAI/roberta-large](https://huggingface.co/FacebookAI/roberta-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3690
+- Accuracy: 0.8716
+- Precision: 0.4404
+- Recall: 0.5517
+- F1: 0.4898
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2.249783020581008e-05
 - train_batch_size: 32
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.3432        | 1.0   | 52   | 0.0883          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.282         | 2.0   | 104  | 0.1779          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.2616        | 3.0   | 156  | 0.1050          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.2628        | 4.0   | 208  | 0.1788          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.2505        | 5.0   | 260  | 0.1225          | 0.8947   | 0.6471    | 0.1264 | 0.2115 |
+| 0.2287        | 6.0   | 312  | 0.1009          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.2023        | 7.0   | 364  | 0.1473          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.1884        | 8.0   | 416  | 0.1152          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.1822        | 9.0   | 468  | 0.1090          | 0.8883   | 0.0       | 0.0    | 0.0    |
+| 0.1596        | 10.0  | 520  | 0.1550          | 0.8652   | 0.4297    | 0.6322 | 0.5116 |
+| 0.1388        | 11.0  | 572  | 0.1388          | 0.8768   | 0.4536    | 0.5057 | 0.4783 |
+| 0.1378        | 12.0  | 624  | 0.1370          | 0.8793   | 0.4624    | 0.4943 | 0.4778 |
+| 0.1429        | 13.0  | 676  | 0.1050          | 0.9024   | 0.5902    | 0.4138 | 0.4865 |
+| 0.1124        | 14.0  | 728  | 0.1775          | 0.8485   | 0.3869    | 0.6092 | 0.4732 |
+| 0.1133        | 15.0  | 780  | 0.1464          | 0.8793   | 0.4632    | 0.5057 | 0.4835 |
+| 0.0989        | 16.0  | 832  | 0.2222          | 0.8575   | 0.4062    | 0.5977 | 0.4837 |
+| 0.108         | 17.0  | 884  | 0.2669          | 0.8729   | 0.4464    | 0.5747 | 0.5025 |
+| 0.1096        | 18.0  | 936  | 0.2570          | 0.8768   | 0.4563    | 0.5402 | 0.4947 |
+| 0.085         | 19.0  | 988  | 0.2943          | 0.8755   | 0.4528    | 0.5517 | 0.4974 |
+| 0.1164        | 20.0  | 1040 | 0.3690          | 0.8716   | 0.4404    | 0.5517 | 0.4898 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1b84c43ccad247a462d642e17115aa8af25eefbca964296182c5d1377c6af3e2
 size 1421495416

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef97e02eea81dab200967fe6af05fad275e44d99cba604756d906d78e6b7092a
 size 1421495416