HachiML
/

Mistral-7B-v0.3-m2-lora

Text Generation

text-generation-inference

Model card Files Files and versions

HachiML commited on May 28, 2024

Commit

ffe9445

·

verified ·

1 Parent(s): 8a204c3

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -15,8 +15,7 @@ tags:
  - [HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep](https://huggingface.co/HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep)のAdapterをマージしたモデル
  - This model is a fine-tuned version of [HachiML/Mistral-7B-v0.3-m1-lora](https://huggingface.co/HachiML/Mistral-7B-v0.3-m1-lora) on following datasets.
-   - [HachiML/oasst1_for_self-rewarding_IFT](https://huggingface.co/datasets/HachiML/oasst1_for_self-rewarding_IFT)
-   - [HachiML/oasst1_for_self-rewarding_EFT_MSv0.3](https://huggingface.co/datasets/HachiML/oasst1_for_self-rewarding_EFT_MSv0.3)
 ## Model Details

  - [HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep](https://huggingface.co/HachiML/Mistral-7B-v0.3-dpo-lora_sr_m2_lr1e-05_2ep)のAdapterをマージしたモデル
  - This model is a fine-tuned version of [HachiML/Mistral-7B-v0.3-m1-lora](https://huggingface.co/HachiML/Mistral-7B-v0.3-m1-lora) on following datasets.
+   - [HachiML/self-rewarding_AIFT_MSv0.3_lora](https://huggingface.co/datasets/HachiML/self-rewarding_AIFT_MSv0.3_lora)(split=AIFT_M1)
 ## Model Details