mistral-dpo-peft / README.md
jasperyeoh2's picture
Create README.md
cae8908 verified
|
raw
history blame
137 Bytes
metadata
datasets:
  - jasperyeoh2/pairrm-preference-dataset
  - GAIR/lima
base_model:
  - mistralai/Mistral-7B-Instruct-v0.2
tags:
  - PEFT
  - DPO