End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -96,7 +96,7 @@ xformers_attention: true
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.7504
 ## Model description
@@ -134,8 +134,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.884         | 0.0041 | 1    | 3.2208          |
-| 2.637         | 0.1024 | 25   | 2.8161          |
-| 2.5138        | 0.2049 | 50   | 2.7504          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7513
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.884         | 0.0041 | 1    | 3.2208          |
+| 2.636         | 0.1024 | 25   | 2.8128          |
+| 2.5226        | 0.2049 | 50   | 2.7513          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "o_proj",
-    "k_proj",
-    "down_proj",
     "gate_proj",
-    "up_proj",
     "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
     "gate_proj",
+    "k_proj",
     "v_proj",
+    "o_proj",
+    "up_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:677c384a283b2d53517506df98c7e88d9063839384437d1da47bed446801d7f4
 size 147859242

 version https://git-lfs.github.com/spec/v1
+oid sha256:99100fc8cb9629f23ee002d2b3bbc8733215ad01cc1381efb5be3c9183578b6a
 size 147859242

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:23310b84d6a0c378e814a0af56e6001d95fb6d450458fba02ee243ba4bb54619
 size 147770496

 version https://git-lfs.github.com/spec/v1
+oid sha256:8277984d9c60132214b91aa106e3c06d9041f5b092d5cdc6114f694d6f2352d9
 size 147770496

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9f83fc5afa9c2c0b02c33e188cde50d7502ddda90bc13a39239ce88b24a8e36d
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c6def4580f714d6ad7e4a090722504acfd79476939235ffabc027470341e135
 size 6776