urarik commited on
Commit
89e79e8
·
verified ·
1 Parent(s): 8230d3d

Model save

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: google/umt5-small
5
  tags:
6
  - generated_from_trainer
7
  metrics:
@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # t5-asr-CV16
18
 
19
- This model is a fine-tuned version of [google/umt5-small](https://huggingface.co/google/umt5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 20.7094
22
- - Wer: 3.7797
23
 
24
  ## Model description
25
 
@@ -38,7 +38,7 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 0.0005
42
  - train_batch_size: 64
43
  - eval_batch_size: 64
44
  - seed: 42
@@ -47,22 +47,23 @@ The following hyperparameters were used during training:
47
  - optimizer: Use paged_lion_8bit and the args are:
48
  No additional optimizer arguments
49
  - lr_scheduler_type: cosine
50
- - lr_scheduler_warmup_ratio: 0.08333333333333333
51
- - num_epochs: 3
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Wer |
56
  |:-------------:|:------:|:----:|:---------------:|:------:|
57
- | 62.4704 | 0.3279 | 10 | 15.1946 | 3.7820 |
58
- | 60.6268 | 0.6557 | 20 | 16.9860 | 3.7589 |
59
- | 58.4451 | 0.9836 | 30 | 18.5103 | 4.0257 |
60
- | 53.5537 | 1.2951 | 40 | 19.5617 | 4.2520 |
61
- | 54.9241 | 1.6230 | 50 | 20.1924 | 4.1488 |
62
- | 53.9461 | 1.9508 | 60 | 20.4946 | 3.9207 |
63
- | 50.5189 | 2.2623 | 70 | 20.6034 | 3.8890 |
64
- | 52.8163 | 2.5902 | 80 | 20.6852 | 3.8316 |
65
- | 52.9224 | 2.9180 | 90 | 20.7094 | 3.7797 |
 
66
 
67
 
68
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: urarik/t5-asr-CV16
5
  tags:
6
  - generated_from_trainer
7
  metrics:
 
16
 
17
  # t5-asr-CV16
18
 
19
+ This model is a fine-tuned version of [urarik/t5-asr-CV16](https://huggingface.co/urarik/t5-asr-CV16) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 17.9832
22
+ - Wer: 3.9749
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 0.0001
42
  - train_batch_size: 64
43
  - eval_batch_size: 64
44
  - seed: 42
 
47
  - optimizer: Use paged_lion_8bit and the args are:
48
  No additional optimizer arguments
49
  - lr_scheduler_type: cosine
50
+ - lr_scheduler_warmup_ratio: 0.05
51
+ - num_epochs: 5
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Wer |
56
  |:-------------:|:------:|:----:|:---------------:|:------:|
57
+ | 61.3853 | 0.4918 | 15 | 15.5590 | 3.7388 |
58
+ | 60.7398 | 0.9836 | 30 | 16.0317 | 3.8106 |
59
+ | 57.146 | 1.4590 | 45 | 16.5961 | 3.8483 |
60
+ | 59.4602 | 1.9508 | 60 | 17.0526 | 3.7376 |
61
+ | 56.0474 | 2.4262 | 75 | 17.4439 | 3.9441 |
62
+ | 58.6886 | 2.9180 | 90 | 17.6277 | 3.7956 |
63
+ | 55.4761 | 3.3934 | 105 | 17.8121 | 3.9299 |
64
+ | 58.2042 | 3.8852 | 120 | 17.8014 | 3.9418 |
65
+ | 55.1715 | 4.3607 | 135 | 17.8874 | 4.0185 |
66
+ | 58.111 | 4.8525 | 150 | 17.9832 | 3.9749 |
67
 
68
 
69
  ### Framework versions
runs/Feb10_19-04-32_c0e556c53359/events.out.tfevents.1739214274.c0e556c53359.18.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:79cc823ffe86c9abf73626acbbb013b7b8a72db7c336fec8bacb868dcdd85bc5
3
- size 6143
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4930d66a2f7b89a93af759bb746fe9b7fc1cc7e9af82e0fe06d2349923ccc300
3
+ size 12227