Training in progress epoch 0
Browse files
README.md
CHANGED
@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
This model is a fine-tuned version of [distilbert-base-cased-distilled-squad](https://huggingface.co/distilbert-base-cased-distilled-squad) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
-
- Train Loss:
|
18 |
-
- Validation Loss: 1.
|
19 |
-
- Epoch:
|
20 |
|
21 |
## Model description
|
22 |
|
@@ -35,15 +35,14 @@ More information needed
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
-
- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps':
|
39 |
- training_precision: float32
|
40 |
|
41 |
### Training results
|
42 |
|
43 |
| Train Loss | Validation Loss | Epoch |
|
44 |
|:----------:|:---------------:|:-----:|
|
45 |
-
|
|
46 |
-
| 0.4442 | 1.2472 | 1 |
|
47 |
|
48 |
|
49 |
### Framework versions
|
|
|
14 |
|
15 |
This model is a fine-tuned version of [distilbert-base-cased-distilled-squad](https://huggingface.co/distilbert-base-cased-distilled-squad) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
+
- Train Loss: 1.7381
|
18 |
+
- Validation Loss: 1.3996
|
19 |
+
- Epoch: 0
|
20 |
|
21 |
## Model description
|
22 |
|
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
+
- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 46, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
|
39 |
- training_precision: float32
|
40 |
|
41 |
### Training results
|
42 |
|
43 |
| Train Loss | Validation Loss | Epoch |
|
44 |
|:----------:|:---------------:|:-----:|
|
45 |
+
| 1.7381 | 1.3996 | 0 |
|
|
|
46 |
|
47 |
|
48 |
### Framework versions
|
logs/train/events.out.tfevents.1656582647.34ee2ceb6225.73.0.v2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:16b0e2d371d66e3e31d73a2f96ce9d5899e595c93cfc4e8b2e2b61f6f53019c8
|
3 |
+
size 1523663
|
logs/validation/events.out.tfevents.1656582691.34ee2ceb6225.73.1.v2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:17edfc2f784ded060bc392ab65e9ab53eab4b42be658f4da3124dac80dfd5720
|
3 |
+
size 193
|
tf_model.h5
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 260895816
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f5b0e9888df88e035fc2e44a1a3949cbaee3e4466c1d3cc0db20c3d2e0f2a197
|
3 |
size 260895816
|