Update README.md
Browse files
README.md
CHANGED
|
@@ -15,8 +15,7 @@ tags:
|
|
| 15 |
|
| 16 |
- [HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep](https://huggingface.co/HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep)のAdapterをマージしたモデル
|
| 17 |
- This model is a fine-tuned version of [HachiML/Mistral-7B-v0.3-m1-lora](https://huggingface.co/HachiML/Mistral-7B-v0.3-m1-lora) on following datasets.
|
| 18 |
-
- [HachiML/
|
| 19 |
-
- [HachiML/oasst1_for_self-rewarding_EFT_MSv0.3](https://huggingface.co/datasets/HachiML/oasst1_for_self-rewarding_EFT_MSv0.3)
|
| 20 |
|
| 21 |
|
| 22 |
## Model Details
|
|
|
|
| 15 |
|
| 16 |
- [HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep](https://huggingface.co/HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep)のAdapterをマージしたモデル
|
| 17 |
- This model is a fine-tuned version of [HachiML/Mistral-7B-v0.3-m1-lora](https://huggingface.co/HachiML/Mistral-7B-v0.3-m1-lora) on following datasets.
|
| 18 |
+
- [HachiML/self-rewarding_AIFT_MSv0.3_lora](https://huggingface.co/datasets/HachiML/self-rewarding_AIFT_MSv0.3_lora)(split=AIFT_M1)
|
|
|
|
| 19 |
|
| 20 |
|
| 21 |
## Model Details
|