Model save

Browse files

Files changed (5) hide show

README.md +79 -0
intent_report_test.txt +75 -0
model.safetensors +1 -1
model_predict_test.csv +0 -0
slot_report_test.txt +59 -0

README.md ADDED Viewed

	@@ -0,0 +1,79 @@

+---
+library_name: transformers
+license: mit
+base_model: FacebookAI/xlm-roberta-base
+tags:
+- generated_from_trainer
+model-index:
+- name: xlm-roberta-base_massive_crf_v1
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# xlm-roberta-base_massive_crf_v1
+This model is a fine-tuned version of [FacebookAI/xlm-roberta-base](https://huggingface.co/FacebookAI/xlm-roberta-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.4117
+- Slot P: 0.6934
+- Slot R: 0.7706
+- Slot F1: 0.7300
+- Slot Exact Match: 0.6995
+- Intent Acc: 0.8495
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 128
+- eval_batch_size: 128
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 256
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_ratio: 0.06
+- num_epochs: 30
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Slot P | Slot R | Slot F1 | Slot Exact Match | Intent Acc |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:-------:|:----------------:|:----------:|
+| No log        | 1.0   | 45   | 22.8757         | 0.0    | 0.0    | 0.0     | 0.3187           | 0.0300     |
+| 95.1993       | 2.0   | 90   | 15.1787         | 0.3194 | 0.2164 | 0.2580  | 0.3015           | 0.1117     |
+| 36.1644       | 3.0   | 135  | 10.7793         | 0.4180 | 0.4502 | 0.4335  | 0.4506           | 0.1864     |
+| 24.5568       | 4.0   | 180  | 7.5359          | 0.5813 | 0.6333 | 0.6062  | 0.5706           | 0.3586     |
+| 16.5092       | 5.0   | 225  | 5.7306          | 0.6266 | 0.7020 | 0.6621  | 0.6203           | 0.5957     |
+| 11.609        | 6.0   | 270  | 4.9020          | 0.6610 | 0.7363 | 0.6966  | 0.6626           | 0.7280     |
+| 8.4757        | 7.0   | 315  | 4.4249          | 0.6701 | 0.7448 | 0.7055  | 0.6744           | 0.7762     |
+| 6.8454        | 8.0   | 360  | 4.3691          | 0.6841 | 0.7532 | 0.7170  | 0.6960           | 0.7973     |
+| 5.6898        | 9.0   | 405  | 4.4460          | 0.6747 | 0.7647 | 0.7169  | 0.6886           | 0.8141     |
+| 4.6831        | 10.0  | 450  | 4.2133          | 0.7067 | 0.7552 | 0.7302  | 0.7073           | 0.8342     |
+| 4.6831        | 11.0  | 495  | 4.4300          | 0.6954 | 0.7542 | 0.7236  | 0.6995           | 0.8347     |
+| 3.9992        | 12.0  | 540  | 4.3942          | 0.6977 | 0.7637 | 0.7292  | 0.7024           | 0.8416     |
+| 3.5154        | 13.0  | 585  | 4.4117          | 0.6934 | 0.7706 | 0.7300  | 0.6995           | 0.8495     |
+### Framework versions
+- Transformers 4.55.0
+- Pytorch 2.7.0+cu126
+- Datasets 3.6.0
+- Tokenizers 0.21.4

intent_report_test.txt ADDED Viewed

	@@ -0,0 +1,75 @@

+              precision    recall  f1-score   support
+           0       0.86      0.94      0.90        88
+           1       0.76      0.94      0.84        36
+           2       0.92      0.97      0.94        35
+           3       0.81      0.83      0.82        35
+           4       0.92      0.88      0.90        26
+           5       0.00      0.00      0.00         1
+           6       0.92      0.79      0.85        43
+           7       0.00      0.00      0.00         4
+           8       1.00      0.83      0.91        18
+           9       0.87      0.85      0.86        72
+          10       0.95      1.00      0.97        39
+          11       0.68      1.00      0.81        15
+          12       0.57      0.54      0.56       169
+          13       0.93      0.96      0.94       156
+          14       0.56      0.69      0.62        13
+          15       0.67      0.67      0.67        12
+          16       0.89      0.77      0.83        22
+          17       0.75      0.81      0.78        26
+          18       0.92      0.81      0.86        27
+          19       0.73      0.87      0.79        31
+          20       0.89      0.80      0.85        41
+          21       0.83      0.87      0.85        39
+          22       0.89      0.86      0.88       124
+          23       0.91      0.85      0.88        34
+          24       1.00      0.40      0.57        10
+          25       0.95      0.95      0.95        19
+          26       0.87      0.84      0.86        57
+          27       0.79      0.76      0.78        25
+          28       0.00      0.00      0.00         6
+          29       0.00      0.00      0.00         6
+          30       0.90      0.99      0.94        67
+          31       0.72      0.62      0.67        21
+          32       0.74      0.83      0.79       126
+          33       0.95      0.92      0.93       114
+          34       0.74      0.88      0.81        26
+          35       0.88      0.64      0.74        11
+          36       0.75      0.81      0.78        72
+          37       0.00      0.00      0.00         0
+          38       1.00      0.20      0.33        15
+          39       0.91      0.80      0.85        25
+          40       0.93      0.93      0.93        43
+          41       0.00      0.00      0.00         3
+          42       0.87      0.78      0.82        51
+          43       0.65      0.36      0.46        36
+          44       0.96      0.92      0.94       119
+          45       0.81      0.91      0.86       176
+          46       0.74      0.91      0.82        32
+          47       0.97      0.88      0.92        81
+          48       0.88      0.93      0.90        41
+          49       0.74      0.83      0.78       141
+          50       0.88      0.90      0.89       209
+          51       0.92      0.94      0.93        35
+          52       0.95      0.90      0.93        21
+          53       0.98      0.90      0.94        52
+          54       0.92      0.96      0.94        23
+          55       0.76      0.80      0.78        20
+          56       0.94      0.86      0.90        36
+          57       0.62      0.83      0.71        35
+          58       0.92      0.70      0.79        63
+          59       0.85      0.80      0.83        51
+    accuracy                           0.84      2974
+   macro avg       0.76      0.74      0.74      2974
+weighted avg       0.84      0.84      0.83      2974
+Confusion matrix:
+[[83  0  0 ...  0  0  0]
+ [ 0 34  0 ...  0  0  0]
+ [ 0  0 34 ...  0  0  0]
+ ...
+ [ 0  0  0 ... 29  0  0]
+ [ 0  0  0 ...  0 44  0]
+ [ 0  0  0 ...  0  0 41]]

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:76689ec22ded558d752254d1e294d2a6d9a2bd03bfc835567aaa86f5d882c98b
 size 1112775472

 version https://git-lfs.github.com/spec/v1
+oid sha256:402d2de6d7d404ac8d90f55b33f0637121a62e29e6b21fee847b4b608623def1
 size 1112775472

model_predict_test.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

slot_report_test.txt ADDED Viewed

	@@ -0,0 +1,59 @@

+                      precision    recall  f1-score   support
+          alarm_type       0.00      0.00      0.00         2
+            app_name       0.08      0.20      0.11         5
+         artist_name       0.69      0.85      0.76        61
+    audiobook_author       0.00      0.00      0.00         5
+      audiobook_name       0.71      0.74      0.72        23
+       business_name       0.75      0.77      0.76        92
+       business_type       0.50      0.58      0.54        31
+       change_amount       0.38      0.33      0.35         9
+         coffee_type       0.33      0.25      0.29         4
+          color_type       0.60      0.69      0.64        26
+        cooking_type       0.00      0.00      0.00         8
+       currency_name       0.81      0.96      0.88        50
+                date       0.81      0.89      0.85       415
+     definition_word       0.77      0.80      0.79        51
+         device_type       0.80      0.70      0.75        57
+          drink_type       0.00      0.00      0.00         1
+       email_address       0.89      0.89      0.89         9
+        email_folder       0.57      0.80      0.67         5
+          event_name       0.67      0.71      0.69       260
+           food_type       0.55      0.74      0.63        72
+           game_name       0.86      0.92      0.89        26
+   general_frequency       0.68      0.75      0.71        20
+         house_place       0.83      0.90      0.86        58
+          ingredient       0.00      0.00      0.00         6
+           joke_type       0.45      0.45      0.45        11
+           list_name       0.73      0.67      0.70        61
+           meal_type       0.61      0.94      0.74        18
+          media_type       0.83      0.80      0.82       128
+          movie_name       0.00      0.00      0.00         2
+          movie_type       0.00      0.00      0.00         3
+         music_album       0.00      0.00      0.00         1
+    music_descriptor       0.00      0.00      0.00         7
+         music_genre       0.69      0.84      0.76        50
+          news_topic       0.52      0.58      0.55        52
+          order_type       0.61      0.85      0.71        20
+              person       0.75      0.83      0.79       216
+       personal_info       0.71      0.71      0.71        14
+          place_name       0.78      0.79      0.78       281
+      player_setting       0.58      0.45      0.51        40
+       playlist_name       0.00      0.00      0.00        15
+  podcast_descriptor       0.43      0.42      0.43        24
+        podcast_name       0.75      0.71      0.73        17
+          radio_name       0.49      0.55      0.51        33
+            relation       0.72      0.75      0.73        59
+           song_name       0.47      0.64      0.54        39
+                time       0.70      0.70      0.70       191
+           time_zone       0.58      0.54      0.56        13
+           timeofday       0.70      0.70      0.70        60
+    transport_agency       0.88      0.78      0.82         9
+transport_descriptor       0.00      0.00      0.00         2
+      transport_name       0.00      0.00      0.00         4
+      transport_type       0.76      0.83      0.79        65
+  weather_descriptor       0.61      0.68      0.64        82
+           micro avg       0.71      0.75      0.73      2813
+           macro avg       0.50      0.54      0.52      2813
+        weighted avg       0.70      0.75      0.72      2813