DeepDream2045 commited on
Commit
f039639
·
verified ·
1 Parent(s): fae998a

End of training

Browse files
README.md CHANGED
@@ -95,7 +95,7 @@ xformers_attention: true
95
 
96
  This model is a fine-tuned version of [peft-internal-testing/tiny-dummy-qwen2](https://huggingface.co/peft-internal-testing/tiny-dummy-qwen2) on the None dataset.
97
  It achieves the following results on the evaluation set:
98
- - Loss: 11.9305
99
 
100
  ## Model description
101
 
@@ -133,8 +133,8 @@ The following hyperparameters were used during training:
133
  | Training Loss | Epoch | Step | Validation Loss |
134
  |:-------------:|:------:|:----:|:---------------:|
135
  | 11.9314 | 0.0009 | 1 | 11.9316 |
136
- | 11.9265 | 0.0221 | 25 | 11.9308 |
137
- | 11.9277 | 0.0442 | 50 | 11.9305 |
138
 
139
 
140
  ### Framework versions
 
95
 
96
  This model is a fine-tuned version of [peft-internal-testing/tiny-dummy-qwen2](https://huggingface.co/peft-internal-testing/tiny-dummy-qwen2) on the None dataset.
97
  It achieves the following results on the evaluation set:
98
+ - Loss: 11.9307
99
 
100
  ## Model description
101
 
 
133
  | Training Loss | Epoch | Step | Validation Loss |
134
  |:-------------:|:------:|:----:|:---------------:|
135
  | 11.9314 | 0.0009 | 1 | 11.9316 |
136
+ | 11.9266 | 0.0221 | 25 | 11.9309 |
137
+ | 11.9282 | 0.0442 | 50 | 11.9307 |
138
 
139
 
140
  ### Framework versions
adapter_config.json CHANGED
@@ -20,11 +20,11 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
23
  "v_proj",
24
- "down_proj",
25
- "o_proj",
26
  "q_proj",
27
- "k_proj",
 
28
  "up_proj",
29
  "gate_proj"
30
  ],
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
+ "k_proj",
24
  "v_proj",
 
 
25
  "q_proj",
26
+ "o_proj",
27
+ "down_proj",
28
  "up_proj",
29
  "gate_proj"
30
  ],
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:30045b4a1c7bba513667889e6ddadfca3e589657436c2a73e551ef44de75e8f7
3
  size 55170
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c546a929dd0cfb01006cbd5e7618ed5d9a703a68d7463c9cd02df8c87f082914
3
  size 55170
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dc53c144558ee904873659330389a0945b7e7b5e498309a90bcd1648852f9e6d
3
  size 48552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5f3f6dfe09555a9918947eae02e1bee9477e06d5c745fa0e77cbbfe9c669525
3
  size 48552
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34dd7bd0811ed6d68d82cf0b98d7b29f79f74683cfc455f458407a01e3eb2776
3
  size 6776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:108c5f3f22adcc9d16d2953e63cd5d58fc20f404970f68f6bcd270a66e117481
3
  size 6776