End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1986
 ## Model description
@@ -53,9 +53,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 7    | 0.5523          |
-| No log        | 2.0   | 14   | 0.2448          |
-| No log        | 3.0   | 21   | 0.1986          |
 ### Framework versions

 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1991
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 7    | 0.5524          |
+| No log        | 2.0   | 14   | 0.2450          |
+| No log        | 3.0   | 21   | 0.1991          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -26,9 +26,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
     "c_attn",
-    "k_proj"
   ],
   "task_type": null,
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "k_proj",
     "c_attn",
+    "v_proj"
   ],
   "task_type": null,
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:641706cde3900c7bf2e85edc461dd7ccfa55304890ad9f0499f24fc4e09a9973
 size 62969640

 version https://git-lfs.github.com/spec/v1
+oid sha256:4fbb82086265a3fd111da216b12775298ca4a8c6e5adc8be78af6710a4b604a9
 size 62969640

runs/Feb10_16-34-25_idc-training-gpu-compute-03/events.out.tfevents.1739205266.idc-training-gpu-compute-03.350171.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e8d4510bf46f1dd277ceb9b60b02f9d383eb2c21762a0314f0e4ff68a1fc6e78
+size 7271

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:36d3ebbff9ae7d7ee1dd66bdc7107c4f008d7894d4c3cf0428e36ae36d7ce233
 size 5624

 version https://git-lfs.github.com/spec/v1
+oid sha256:c176ea4b7d2bb0aa5842b708890cf575f8b516ca5bbd27f645c7f481f6647368
 size 5624