remisultan commited on
Commit
36204db
Β·
1 Parent(s): 7017c9e

fix: add quantization models and performance

Browse files
Files changed (3) hide show
  1. README.md +44 -0
  2. model.onnx +3 -0
  3. model.quant.onnx +3 -0
README.md CHANGED
@@ -15,6 +15,7 @@ language:
15
  - ja
16
  - tt
17
  license: openrail++
 
18
  base_model:
19
  - distilbert/distilbert-base-multilingual-cased
20
  datasets:
@@ -34,6 +35,8 @@ We perform an 85/15 train-test split per language based on the `textdetox` datas
34
 
35
  ## Performance Overview
36
 
 
 
37
  | Language | Validation F1 | Training F1 | Ξ”F1 |
38
  |------------|---------------|-------------|---------|
39
  | Russian | 0.9572 | 0.9897 | -0.0324 |
@@ -52,6 +55,47 @@ We perform an 85/15 train-test split per language based on the `textdetox` datas
52
  | Hinglish | 0.7234 | 0.8533 | -0.1299 |
53
  | Hebrew | 0.6455 | 0.8441 | -0.1987 |
54
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
  ## License
56
 
57
  This model is licensed under [OpenRAIL++](LICENSE)
 
15
  - ja
16
  - tt
17
  license: openrail++
18
+ library: transformers
19
  base_model:
20
  - distilbert/distilbert-base-multilingual-cased
21
  datasets:
 
35
 
36
  ## Performance Overview
37
 
38
+ ### Original model
39
+
40
  | Language | Validation F1 | Training F1 | Ξ”F1 |
41
  |------------|---------------|-------------|---------|
42
  | Russian | 0.9572 | 0.9897 | -0.0324 |
 
55
  | Hinglish | 0.7234 | 0.8533 | -0.1299 |
56
  | Hebrew | 0.6455 | 0.8441 | -0.1987 |
57
 
58
+ ### Quantized model (ONNX)
59
+
60
+ | Language | Original Val F1 | Quantized Val F1 | Ξ” Val F1 |
61
+ | --------- | --------------- | ---------------- | ----------- |
62
+ | Russian | 0.9572 | 0.9609 | **+0.0037** |
63
+ | English | 0.9528 | 0.9495 | **βˆ’0.0033** |
64
+ | German | 0.8904 | 0.8842 | **βˆ’0.0062** |
65
+ | Hindi | 0.9248 | 0.8940 | **βˆ’0.0300** |
66
+ | French | 0.9446 | 0.9351 | **βˆ’0.0095** |
67
+ | Japanese | 0.8658 | 0.8584 | **βˆ’0.0074** |
68
+ | Ukrainian | 0.8997 | 0.8988 | **βˆ’0.0009** |
69
+ | Tatar | 0.9200 | 0.9148 | **βˆ’0.0052** |
70
+ | Amharic | 0.6513 | 0.6377 | **βˆ’0.0136** |
71
+ | Spanish | 0.8564 | 0.8439 | **βˆ’0.0125** |
72
+ | Chinese | 0.6865 | 0.6697 | **βˆ’0.0168** |
73
+ | Arabic | 0.7563 | 0.7535 | **βˆ’0.0028** |
74
+ | Italian | 0.8223 | 0.8033 | **βˆ’0.0190** |
75
+ | Hinglish | 0.7234 | 0.7260 | **+0.0026** |
76
+ | Hebrew | 0.6455 | 0.6190 | **βˆ’0.0265** |
77
+
78
+ | Language | Original Train F1 | Quantized Train F1 | Ξ” Train F1 |
79
+ | --------- | ----------------- | ------------------ | ----------- |
80
+ | Russian | 0.9897 | 0.9875 | **βˆ’0.0022** |
81
+ | English | 0.9853 | 0.9857 | **+0.0004** |
82
+ | German | 0.9547 | 0.9369 | **βˆ’0.0178** |
83
+ | Hindi | 0.9599 | 0.9335 | **βˆ’0.0264** |
84
+ | French | 0.9874 | 0.9814 | **βˆ’0.0060** |
85
+ | Japanese | 0.9253 | 0.9081 | **βˆ’0.0172** |
86
+ | Ukrainian | 0.9511 | 0.9476 | **βˆ’0.0035** |
87
+ | Tatar | 0.9682 | 0.9631 | **βˆ’0.0051** |
88
+ | Amharic | 0.6915 | 0.6863 | **βˆ’0.0052** |
89
+ | Spanish | 0.9399 | 0.9273 | **βˆ’0.0126** |
90
+ | Chinese | 0.7807 | 0.7596 | **βˆ’0.0211** |
91
+ | Arabic | 0.8550 | 0.8481 | **βˆ’0.0069** |
92
+ | Italian | 0.9271 | 0.9193 | **βˆ’0.0078** |
93
+ | Hinglish | 0.8533 | 0.8436 | **βˆ’0.0097** |
94
+ | Hebrew | 0.8441 | 0.8204 | **βˆ’0.0237** |
95
+
96
+ The quantized model maintains high accuracy and generalization on the evaluation set,
97
+ with negligible performance loss for most languages.
98
+
99
  ## License
100
 
101
  This model is licensed under [OpenRAIL++](LICENSE)
model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bee01a9d2d474a589c5db631d99c382d5502bb81a0fe318d247f46c16b25f060
3
+ size 541416974
model.quant.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cea601d7d45fa1e8fbc8c716127c9efff710339a00bea422d6f2565537b4fd80
3
+ size 135898271