Model save

Files changed (9) hide show

README.md CHANGED Viewed

@@ -1,16 +1,8 @@
 ---
-license: mit
-base_model: HuggingFaceH4/mistral-7b-sft-beta
 tags:
-- alignment-handbook
 - trl
 - dpo
 - generated_from_trainer
-- trl
-- dpo
-- generated_from_trainer
-datasets:
-- RedMist137/Temp_AIHF
 model-index:
 - name: DPO-Zephyr-7B
   results: []
@@ -21,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 # DPO-Zephyr-7B
-This model is a fine-tuned version of [HuggingFaceH4/mistral-7b-sft-beta](https://huggingface.co/HuggingFaceH4/mistral-7b-sft-beta) on the RedMist137/Temp_AIHF dataset.
 ## Model description

 ---
 tags:
 - trl
 - dpo
 - generated_from_trainer
 model-index:
 - name: DPO-Zephyr-7B
   results: []
 # DPO-Zephyr-7B
+This model was trained from scratch on the None dataset.
 ## Model description

all_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "epoch": 0.9993235625704623,
     "total_flos": 0.0,
-    "train_loss": 0.6513021779835009,
-    "train_runtime": 4537.7195,
-    "train_samples": 35474,
-    "train_samples_per_second": 7.818,
-    "train_steps_per_second": 0.061
 }

 {
+    "epoch": 0.9935483870967742,
     "total_flos": 0.0,
+    "train_loss": 0.6444497665801605,
+    "train_runtime": 1967.4135,
+    "train_samples": 9919,
+    "train_samples_per_second": 5.042,
+    "train_steps_per_second": 0.039
 }

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "HuggingFaceH4/mistral-7b-sft-beta",
   "architectures": [
     "MistralForCausalLM"
   ],
@@ -21,6 +21,6 @@
   "tie_word_embeddings": false,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.40.2",
-  "use_cache": true,
   "vocab_size": 32000
 }

 {
+  "_name_or_path": "/root/AIHF/IRL_Alignment_Project-master/AIHF_7B_code/data/AIHF_Mixed/checkpoint-200",
   "architectures": [
     "MistralForCausalLM"
   ],
   "tie_word_embeddings": false,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.40.2",
+  "use_cache": false,
   "vocab_size": 32000
 }

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f5f179c427e1375727182574bc3a342b4caad2d6cd5f2069b3aa86ec52532fd4
 size 4943162336

 version https://git-lfs.github.com/spec/v1
+oid sha256:9fafee4b8739243c35d15355c24548d39ea759f55ef9dfea2fd40e2b5189d2f6
 size 4943162336

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9b624de65f7536e9d6ea0c2e0d27e3c8780ff109ea9e023473b5b772cefe86af
 size 4999819336

 version https://git-lfs.github.com/spec/v1
+oid sha256:9adf34d53485e3101610c90c2dceae5a7fb3097c0ed89b64b22016df3b498e84
 size 4999819336

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0e548873358e30afd5304f376998d717d3306d3673b19b984a392ff738ead249
 size 4540516344

 version https://git-lfs.github.com/spec/v1
+oid sha256:dacde297b3b77825ebb9a21c1e9850b2c1e61f51fa37be38b6bff65c7752137a
 size 4540516344

train_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "epoch": 0.9993235625704623,
     "total_flos": 0.0,
-    "train_loss": 0.6513021779835009,
-    "train_runtime": 4537.7195,
-    "train_samples": 35474,
-    "train_samples_per_second": 7.818,
-    "train_steps_per_second": 0.061
 }

 {
+    "epoch": 0.9935483870967742,
     "total_flos": 0.0,
+    "train_loss": 0.6444497665801605,
+    "train_runtime": 1967.4135,
+    "train_samples": 9919,
+    "train_samples_per_second": 5.042,
+    "train_steps_per_second": 0.039
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ffe33828a9d95943556f7447e05c9e5aa5a55fbf161b8aabd3cfce9a619b452f
 size 6264

 version https://git-lfs.github.com/spec/v1
+oid sha256:bb119e4359a89a9bd0ada30f62936f3daa245d3bfcbca9ffe14a1454a531247f
 size 6264