llama3.1_8b_bwgenerator

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1032
 ## Model description
@@ -51,14 +51,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.7124        | 0.1216 | 40   | 0.2698          |
-| 0.2279        | 0.2433 | 80   | 0.1875          |
-| 0.1546        | 0.3649 | 120  | 0.1293          |
-| 0.1242        | 0.4865 | 160  | 0.1168          |
-| 0.1144        | 0.6081 | 200  | 0.1103          |
-| 0.1097        | 0.7298 | 240  | 0.1064          |
-| 0.107         | 0.8514 | 280  | 0.1043          |
-| 0.105         | 0.9730 | 320  | 0.1032          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0982
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.7155        | 0.1216 | 40   | 0.2546          |
+| 0.218         | 0.2433 | 80   | 0.1804          |
+| 0.1513        | 0.3649 | 120  | 0.1246          |
+| 0.1193        | 0.4865 | 160  | 0.1116          |
+| 0.1092        | 0.6081 | 200  | 0.1051          |
+| 0.1046        | 0.7298 | 240  | 0.1012          |
+| 0.1017        | 0.8514 | 280  | 0.0993          |
+| 0.0999        | 0.9730 | 320  | 0.0982          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -22,8 +22,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:93de4bf57aab1828667bacd328921a78ab4e52d693f424d7d0ba794e8a949deb
 size 6832728

 version https://git-lfs.github.com/spec/v1
+oid sha256:1de99ba3c69896469e24e31d640496d977ca9154e6e8ede2c9d8c58ee1c49a20
 size 6832728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:94b7f1252de2ef7de0ca1f0f926c6ab823e3b4e41e07becfca600b0eb3228a3e
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:d289afc35be50448ca012c8270b76abc4f7753d1d7bd83a50c3267c0533c498d
 size 5496