wkCircle commited on
Commit
d7e2a0c
·
verified ·
1 Parent(s): d8337bd

End of training

Browse files
Files changed (2) hide show
  1. README.md +87 -61
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,61 +1,87 @@
1
- ---
2
- library_name: transformers
3
- license: apache-2.0
4
- base_model: ntu-spml/distilhubert
5
- tags:
6
- - generated_from_trainer
7
- datasets:
8
- - marsyas/gtzan
9
- model-index:
10
- - name: distilhubert-finetuned-gtzan
11
- results: []
12
- ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- # distilhubert-finetuned-gtzan
18
-
19
- This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the GTZAN dataset.
20
- It achieves the following results on the evaluation set:
21
- - eval_loss: 1.8559
22
- - eval_accuracy: 0.57
23
- - eval_runtime: 119.8761
24
- - eval_samples_per_second: 0.834
25
- - eval_steps_per_second: 0.108
26
- - epoch: 1.0
27
- - step: 113
28
-
29
- ## Model description
30
-
31
- More information needed
32
-
33
- ## Intended uses & limitations
34
-
35
- More information needed
36
-
37
- ## Training and evaluation data
38
-
39
- More information needed
40
-
41
- ## Training procedure
42
-
43
- ### Training hyperparameters
44
-
45
- The following hyperparameters were used during training:
46
- - learning_rate: 5e-05
47
- - train_batch_size: 8
48
- - eval_batch_size: 8
49
- - seed: 42
50
- - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
51
- - lr_scheduler_type: linear
52
- - lr_scheduler_warmup_ratio: 0.1
53
- - num_epochs: 10
54
- - mixed_precision_training: Native AMP
55
-
56
- ### Framework versions
57
-
58
- - Transformers 4.48.2
59
- - Pytorch 2.6.0+cu126
60
- - Datasets 3.2.0
61
- - Tokenizers 0.21.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: ntu-spml/distilhubert
5
+ tags:
6
+ - generated_from_trainer
7
+ datasets:
8
+ - marsyas/gtzan
9
+ metrics:
10
+ - accuracy
11
+ model-index:
12
+ - name: distilhubert-finetuned-gtzan
13
+ results:
14
+ - task:
15
+ name: Audio Classification
16
+ type: audio-classification
17
+ dataset:
18
+ name: GTZAN
19
+ type: marsyas/gtzan
20
+ config: all
21
+ split: train
22
+ args: all
23
+ metrics:
24
+ - name: Accuracy
25
+ type: accuracy
26
+ value: 0.82
27
+ ---
28
+
29
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
+ should probably proofread and complete it, then remove this comment. -->
31
+
32
+ # distilhubert-finetuned-gtzan
33
+
34
+ This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the GTZAN dataset.
35
+ It achieves the following results on the evaluation set:
36
+ - Loss: 0.7001
37
+ - Accuracy: 0.82
38
+
39
+ ## Model description
40
+
41
+ More information needed
42
+
43
+ ## Intended uses & limitations
44
+
45
+ More information needed
46
+
47
+ ## Training and evaluation data
48
+
49
+ More information needed
50
+
51
+ ## Training procedure
52
+
53
+ ### Training hyperparameters
54
+
55
+ The following hyperparameters were used during training:
56
+ - learning_rate: 5e-05
57
+ - train_batch_size: 8
58
+ - eval_batch_size: 8
59
+ - seed: 42
60
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
61
+ - lr_scheduler_type: linear
62
+ - lr_scheduler_warmup_ratio: 0.1
63
+ - num_epochs: 10
64
+ - mixed_precision_training: Native AMP
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
69
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
70
+ | 1.5997 | 1.0 | 113 | 1.5628 | 0.56 |
71
+ | 1.0615 | 2.0 | 226 | 1.1134 | 0.68 |
72
+ | 0.9443 | 3.0 | 339 | 0.8761 | 0.78 |
73
+ | 0.491 | 4.0 | 452 | 0.8227 | 0.78 |
74
+ | 0.382 | 5.0 | 565 | 0.7287 | 0.79 |
75
+ | 0.2423 | 6.0 | 678 | 0.6206 | 0.82 |
76
+ | 0.1179 | 7.0 | 791 | 0.6201 | 0.82 |
77
+ | 0.055 | 8.0 | 904 | 0.6419 | 0.83 |
78
+ | 0.0533 | 9.0 | 1017 | 0.6680 | 0.84 |
79
+ | 0.0329 | 10.0 | 1130 | 0.7001 | 0.82 |
80
+
81
+
82
+ ### Framework versions
83
+
84
+ - Transformers 4.47.1
85
+ - Pytorch 2.5.1+cu124
86
+ - Datasets 3.2.0
87
+ - Tokenizers 0.21.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cb76a70e68af28ad862e995b867a1c1fe7603b3124e731433caf941bda737a15
3
  size 94771728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a9b0b8e69ecc4689152c7336a9beb4c92b2857243e5d84d4347ff791bebdf66
3
  size 94771728